STUDIES OF ANGIOSPERM P. G. Martin? and J. M. Dowd? 


PHYLOGENY USING PROTEIN 
SEQUENCES: 


ABSTRACT 


In previous papers we have reported the N-terminal 40 amino acids of the small subunit of rubisco for samples 
from four families of gymnosperms, nine families of monocotyledons, and 26 families of dicotyledons. We expanded 
this list to 122 families of dicots and derived a phylogenetic tree for all 335 species. The main computing program 
used was HENNIG86, with which a reliable result can be assured with only 17 taxa or less, so a major part of this 
paper is concerned with the strategy adopted to divide the 335 species and then to build the parts into an overall 
tree that is as accurate and objective as possible. Comparison with other taxonomy suggests that, at the level of 
placing genera into families, our methods give results that are at least 90% accurate. At higher taxonomic levels 
accuracy may decrease, and the result should be regarded not as a firm conclusion but as a working hypothesis for 
subsequent testing using the longer sequences from nucleic acids. Topics discussed include heterogeneity within 
species, the nature of the N-terminus of rubisco-SSU, and evidence that natural selection is powerful in determining 
amino acid sequence. The rate of evolution has been shown to vary between major taxa, and data suggest that 


angiosperms originated in the Jurassic. 


The problems of angiosperm phylogeny are well 
illustrated by a consideration of the differences 
between four classifications, all less than a decade 
old and all by highly respected and experienced 
authors. The dicotyledons are divided into six sub- 
classes by Cronquist (1981) and seven by Takh- 
tajan (1983), while, for the other two authors, the 
major groupings are superorders, Thorne (1983) 
having 19 and Dahlgren (1983) 25. The number 
of dicotyledonous orders recognized is, respective- 
ly, 58, 72, 41, and 83; these figures alone indicate 
the resulting diversities of names and content, all 
of which reflect our comparative ignorance of the 
course that evolution has taken in the angiosperms. 
In contrast to this, at the next level down the 
hierarchy, there is basic agreement about the "core" 
families to be recognized (Heywood, 1978). 

Macromolecular sequences provide taxonomic 
characters whose homology over widely diverse 
species can be assumed with some confidence. Se- 
quence data can be analyzed objectively with com- 
puters. We will probably see in the next decade 
the publication of nucleic acid sequences long and 
variable enough to solve some of the problems of 


angiosperm phylogeny (e.g., Palmer et al., 1988; 
Zimmer et al., 1989). It is therefore an appropriate 
time, when nucleic acid sequencing is supplanting 
protein sequencing, to set out the results of a de- 
cade of work that has produced 335 partial protein 
sequences from a wide range of angiosperms. These 
sequences are shorter than nucleic acid sequences 
already published and therefore contain less infor- 
mation and are less able to resolve the sequential 
divergences of early radiations. Nevertheless, we 
believe that our phylogenetic trees will indicate 
likely relationships and profitable working hypoth- 


eses for future investigations. 


A SUMMARY OF PUBLISHED INVESTIGATIONS 
USING PROTEIN SEQUENCES 


The pioneer of the use in botany of protein 
sequences for investigating plant phylogeny was D. 
Boulter of the University of Durham, England. 
During the 1970s, Boulter, along with his col- 
leagues and students, published 25 sequences of 
cytochrome c, 12 complete and 58 partial se- 
quences of plastocyanin and seven sequences of 


! We acknowledge with thanks grants from the Australian Research Grants Scheme, the Australian Research 
Council, the Missouri Botanical Garden, the Potter Foundation, the Utah Foundation, and the National Botanic 
Gardens (Canberra). We are indebted to Richard Norrish who, for many years, has kept our machinery in working 
order, As indicated in Table 2, we have obtained leaves from many sources, to all of whom we are grateful; we are 
especially grateful to the Adelaide Botanic Gardens, the Missouri Botanical Garden, the National Botanic Gardens 
(Canberra), and the Royal Botanic Gardens, Kew. 

2 Department of Botany, University of Adelaide, Box 498, G.P.O., Adelaide, South Australia 5001. 


ANN. Missouni Bot. Garb. 78: 296-337. 1991. 


Volume 78, Number 2 
1991 


Martin & Dowd 297 
Angiosperm Phylogeny Using Protein 


Sequences 


ferrodoxin. These have been collated, with ref- 
erences, by Ramshaw (1982), and Scogin (1981) 
has reviewed the results from the taxonomic point 
of view. Although this work generated much in- 
terest, it also gave rise to skepticism, some of which 
can, with hindsight, be attributed to the inadequa- 
cies of computing methods that were being devel- 
oped concurrently. The mostly unfavorable reac- 
tion of systematists, epitomized by the review of 
Cronquist (1976), influenced the cessation of re- 
search in Boulter’s laboratory about 1980. 

Before this, however, partial sequences (up to 
25 N-terminal amino acids) of the small subunit of 
ribulose-1,5-bisphosphate carboxylase/oxygenase 
(rubisco-SSU) were obtained from six species (Has- 
lett et al., 1976; Strobaek et al., 1976). This work 
led to a complete SSU sequence from spinach (Mar- 
tin, 1979), a forerunner of the work presented 
here which concerns the N-terminal 40 amino acids 
of this protein. (The complete sequencing of a 
protein requires prior purification of several frag- 
ments and is at least an order of magnitude more 
time-consuming than the direct sequencing of the 
N-terminus of the whole protein using an automatic 
sequencer.) Nucleotide sequences of rubisco-SSU 
from a few species have been published, and all of 
them have been studied using our method. The 
only new data comparable to our 334 species are 
from two closely related orchids and their hybrid 
(G. C. Martin et al., 1987). We are unaware of 
phylogenetically useful sequences of other proteins 
since those of Grund et al. (1981) and Nakano et 
al. (1981). 

Work in our laboratory has proceeded in five 
phases. In phase 1 species were chosen because 
Boulter had already published their complete se- 
quences of cytochrome c and partial sequences of 
plastocyanin. When a pattern failed to emerge from 
analyses of these data, we decided to sample each 
family with sequences from at least two more rep- 
resentative genera. Thus, the families Apiaceae, 
Asteraceae, Brassicaceae, Caprifoliaceae, Cheno- 
podiaceae, Fabaceae, Malvaceae, Poaceae, Polyg- 
onaceae, Ranunculaceae, and Solanaceae have each 
been sampled at least three times. These early 
results were published in a series of papers (Martin 
et al., 1983; Martin & Dowd, 1984a, b, c). 

The sequences for rubisco-SSU, cytochrome c, 
and plastocyanin were analyzed for these families 
by Martin, Boulter, and Penny (1985) using de- 
rived estimates of familial node sequences. Anal- 
yses of data from single macromolecules were not 
consistent with one another but, for nine of the 
families, a phylogenetic tree derived from combined 
data remained consistent when ferrodoxin or 5S- 


ribosomal RNA (available for some of the families) 
was added. 

This result indicated the need for longer se- 
quences and better sampling of families. Although 
rubisco-SSU was always multiply represented, in 
17 of the 33 samples of other macromolecules 
there was only a single sequence. This situation is 
precarious because, if the average distance from 
a familial node to a species is N, then on the average 
a single sequence will misrepresent the familial node 
by N. This source of error might be responsible 
for part of the poor agreement observed. Sampling 
a family at least twice, preferably from widely 
divergent representatives should give a better es- 
timate of the familial node (see phase 5). 

In phase 2 we sequenced rubisco-SSU from 11 
members of Onagraceae (Martin & Dowd, 198064), 
15 monocotyledons (Martin & Dowd, 1986b), and 
14 species of Solanum (Martin et al., 1986). We 
reasoned that the reliability of our methods might 
be estimated by comparison with taxonomically well 
understood groups. The results were similar to oth- 
er taxonomic treatments. Additional species of As- 
teraceae were also studied and those results will be 
presented in this paper. 

To estimate the rate of evolution, Proteaceae, 
Solanaceae, Fagaceae, and Winteraceae were sam- 
pled in phase 3 using species whose ancestors are 
thought to have been separated by continental drift 
at known times. This led to a preliminary publi- 
cation (Martin & Dowd, 1984b), and the derivation 
of a molecular evolutionary clock (Martin & Dowd, 
1988), which indicated that on average one nu- 
cleotide difference arose between two diverging 
lines once in seven million years. 

In phase 4 we tested the hypothesis that leghe- 
moglobin had evolved in plants by lateral transfer 
from animals. This led to an investigation of all 
species for which leghemoglobin sequences had been 
published, and it was shown that the pathway of 
evolution in those species was closely parallel in 
hemoglobin and rubisco-SSU (Martin & Dowd, 
1986c), suggesting that there was no need to in- 
voke novel evolutionary processes. A consequence 
of this study was that we increased the number of 
species of Fabaceae sequenced to eight (see Group 
14 below) and obtained sequences from several 
additional families. Many of these were too small 
to be studied in the normal course of this investi- 
gation but were obtained either because they are 
known to include nitrogen-fixers or thought to be 
relatives of the legumes; these include Betulaceae, 
Casuarinaceae, Chrysobalanaceae, Coriariaceae, 
Crossosomataceae, Datiscaceae, Elaeagnaceae, 
Moringaceae, and Myricaceae. 


298 


In phase 5 we surveyed the dicotyledons which 
increased the number of families studied from 24 
to 124. 


A SURVEY OF THE DICOTYLEDONS 


There are about 250 families of dicots. Because 
it was impractical to sample all of them, a decision 
was made to sample about half, i.e., to increase 
the number from the 24 mentioned above to 124. 
Three families (Acanthaceae, Loranthaceae, San- 
talaceae) failed for reasons that will be discussed 
later. The additional 97 families were chosen pri- 
marily on the basis of size. The majority of families 
sampled have more than 20 genera. To cover as 
wide a range of variation as possible, some small 
families were also sampled. For example, the order 
Illiciales has only three genera, so the family Schi- 
sandraceae (two genera) was chosen to represent 
it. Only three orders are unrepresented out of 
Thorne’s 41 (two of which are parasitic and devoid 
of rubisco), 10 out of Cronquist’s 58, 19 out of 
Takhtajan’s 72, and 21 out of Dahlgren’s 85. 

It is impractical, mainly because computers are 
limited in their capacities to analyze large numbers 
of taxa simultaneously, to contemplate building a 
phylogenetic tree for 122 families (comprising 310 
species) without some subdivision into groups. We 
have done this by referring to all four current 
phylogenies. Thorne (1983) and Dahlgren (1983) 
have superorders as their major groups, the former 
nominating 19 and the latter 25. If these two 
authors agree that families are in the same super- 
order then they have been grouped together in our 
scheme, with one proviso. Takhtajan (1983) and 
Cronquist (1981) have respectively seven and six 
subclasses as their major groups, and these two 
authors have been allowed a veto; if either if them 
does not also agree that families are in the same 
subfamily, then they are left ungrouped. In this 
way we have divided 102 of the studied families 
into 25 Groups, leaving 20 ungrouped because 
there is disagreement. We are reluctant to use a 
formal term like superorder but need to make it 
clear that our use of Group does have a defined 
meaning, so we have used a capital G. The Groups 
are shown in Table 1. 

It was practicable to sample each new family 
only twice, and we have done this by choosing two 
species not only from different genera but, if pos- 
sible, from different subfamilies or tribes. Some- 
times this criterion has broken down because fresh 
leaves have not been available. 

In Table 2 the 335 species for which sequences 


Annals of the 


Missouri Botanical Garden 


are available are arranged by families and Groups, 
and their sources and sequences are given. 


BIOCHEMICAL METHODS 


The methods published by Martin and Jennings 
(1983) have stood the test of time, so, rather than 
repeat them here, a general description will be 
given and the few modifications mentioned. 

Two methods were described, one for “pungent” 
leaves with high concentrations of phenolics or 
other substances that make protein purification 
difficult, the other for “bland” species whose leaves 
are much more amenable. The bland method gives 
better quality protein and is therefore to be pre- 
ferred. However, because the pungent method works 
well with bland leaves, but not vice versa, it was 
preferred when there was doubt or too few leaves 
for trial extractions. 

Both procedures started with maceration of about 
100 g of leaves from which the midribs were re- 
moved if practicable. For bland leaves the extract- 
ing buffer was essentially a reducing, saline tris- 
HCI buffer at pH 7.4, while for pungent leaves a 
reducing, saline borate buffer at pH 8.6 and con- 
taining the detergent Triton X-100 was used. After 
crude straining and centrifugation to remove solids, 
the extract was passed through a succession of two 
liquid gel columns. A Sephadex G-25 column was 
used first to remove low molecular weight sub- 
stances. A Sepharose 6B column was used to re- 
move remaining low molecular weight substances 
and high molecular weight nucleic acids and mem- 
brane fragments. Eluting buffers were different for 
the two extraction procedures and for the different 
columns used. The protein was precipitated with 
ammonium sulfate for the bland method and with 
acetone for pungent. Procedures after the second 
column were the same for both types of leaves. 
The protein was S-carboxymethylated at pH 8.6 
to break disulphide bridges between cysteine res- 
idues and then passed through a long column of 
Sephadex G-100 in an eluting buffer containing 
sodium dodecyl sulfate. This separated the large 
subunit from the small subunit, which was precip- 
itated in acetone and dried before sequencing. (A 
variation of this procedure was to use a column of 
G75 followed by G-100.) 

The methods are rather crude but are successful 
because rubisco is a very large protein and, by a 
considerable margin, the most abundant protein in 
leaves. 

About 5 mg of small subunit (in 0.5 ml of water 
without polybrene) was sequenced on the Beckman 


Volume 78, Number 2 


1991 


TABLE 1. 


Martin & Dowd 
Angiosperm Phylogeny Using Protein 
Sequences 


(1981), Dahlgren (1983), Takhtajan (1983), and Thorne (1983). 


299 


Families of dicotyledons grouped because they are placed in the same major taxon by all of Cronquist 


GROUP 1 GROUP 4 GROUP 9 
Magnoli Ulm Dipterocarp 
Winter Mor Elaeocarp 
Annon Urtic Tili 
Myristic GROUP 5 Sterculi 
Schisandr Hamamelid Bombac 
Monimi Betul Malv 
Laur Fag Group 10 
Aristoloch Casuarin Viol 
Calycanth GROUP 6 Flacourti 
GROUP 2 Dilleni Datisc 
Berberid Thea Cucurbit 
Ranuncul Ochn Salic 
Lardizabal Clusi Cappar 
Menisperm GROUP 7 Brassic 
Papaver Myric Resed 
GROUP 3 Jugland Moring 
Cabomb Group 8 GROUP 11 
Nymphae Caryophyll Sapot 
Nyctagin Styrac 
Amaranth Primul 
Phytolacc Myrsin 
Chenopodi 
FAMILIES THAT DO NOT FIT INTO ONE OF THE GROUPS 
Aster Coriari Goodeni 
Bux Crossosomat Hydrophyll 
Campanul Elaeagn Lecythid 
Chrysobalan Euphorbi Loas 


Note: “-aceae” omitted from all names. 


890C automatic sequencer using Beckman's stan- 
dard quadrol program with 50% quadrol buffer. 
The phenylthiohydantoin (PTH) derivatives of the 
amino acids were identified using a Waters HPLC 
instrument with a C- 18 radially compressed column 
and eluted with 0.1M sodium acetate (pH 6.0) and 
acetonitrile. This did not distinguish two pairs of 
amino acids and was therefore supplemented with 
TLC. 

Using these methods, we could, without assis- 
tance, produce two proteins each week and se- 
quence two others. 


FAILURES 


Although 90% of attempts led to successful se- 
quences, the remaining 10% deserve brief atten- 
tion. Unless there was an identified reason for fail- 
ure that could be corrected, our policy was to try 
another representative of the family. 

Faults that could be corrected include the 
amounts of extraction and elution buffers used. 
Some plants gave extracts that were mucilaginous 
to the point of setting solid. Dilution of the extract 


GRouP 12 GRouP 17 Group 21 
Eric Connar Lami 
Epacrid Sapind Verben 
GRouP 13 Anacardi Group 22 
Cunoni Simaroub Solan 

Ros Meli Convolvul 
Saxifrag Rut Polemoni 
Group 14 Group 18 Group 23 
Caesalpini Halorag Scrophulari 
Mimos Rhizophor Gesneri 
Papilioni Group 19 Bignoni 
Group 15 Zygophyll Pedali 
Trap Gerani Group 24 
Lyth Tropaeoli Valerian 
Myrt Malpighi Caprifol 
Punic Group 20 GRouP 25 
Onagr Logani Api 
Melastomat Gentian Arali 
Combret Apocyn 

Group 16 Asclepiad 

Olac Ole 

Celastr Rubi 

Nelumbon Polygon Thymelae 
Piper Prote Vit 
Plumbagin Rhamn 


corrected this. This problem occurred in Onagra- 
ceae and a few others with small leaves containing 
a high proportion of veins. Insolubility of the pro- 
tein, leading to precipitation in columns, could 
sometimes be corrected by loading a more dilute 
extract. Plants with C4 photosynthesis, and rubisco 
tightly bound in bundle sheaths, were avoided if 
possible. Plants with C3 photosynthesis often occur 
in the same genera or families and were unlikely 
to be phylogenetically biased. However, if unavoid- 
able (e.g., Welwitschia is reported to be C4), spe- 
cial care was taken during the maceration process. 

It is suspected that the most common cause of 
failure was the presence of powerful proteases in 
the leaves and, in retrospect, it would have been 
profitable to try correcting this with research early 
in the project. Species of Ficus, known to have 
leaf proteases, showed symptoms of this failure. 
Large amounts of protein traveled where the small 
subunit should have been on the G-100 column 
and gave many amino acids at each position when 
sequenced. Another casualty of this sort was Gne- 
tum gnemon, which was particularly desired be- 
cause it is a gymnosperm thought to be close to 


Annals of the 


300 


Missouri Botanical Garden 


gd 


d XN R: S ATT TAG 
E p a 


>> 


VIVIdSTdad TAS TEA 15 ddMAOHRN "5'd/'qunupL eoT3eeuop eUTPUEN 
Y S.D SS d T 43 X TOS ddMAORN '5'g epre[epv/Tseid srsueujee sTIeqieg 
dd dvaovaludauda 


z dnou5 


3 I 
ay A 
“O)°q erzequep 
W /yatus*o* yw (KzəxoTA M cl susoseandind eruuegeuser 
K -Z'N UYU310N/Apueq (TNOPy) #e4qeIolToo ereqUTMOpNesd 
H *5'g M9N/197S104 TrequTM SÁWTIC 
NIM IVIADVIAALN IM 
STASTASTLIAJIAANHNAddMADR *£'g MOY/UCSTTM 5 IƏPYƏY PIOTITIGNI PIPUPBSTUOS 
43H erg ÁAeqexieg/ieung eoruode( einspey 
sos AWAD WUGNWS IHOS 
Ww erg ərodeburs/`33noHgH suerbezg PoTISTIAM 
W £'g ezodebuts/rewT A PTIOTISIPT] eweuy 
SAN AWAOWOILSIYAN 
erg SPTeTEpW/eUTTOW snpToq snuned 
WNW AWAOWIWNINOW 
TH org epre[epv/Tbueadg('znoT) OST] PTTSUITH 
TR *9*4/*pog-*Tnog euerbue[nos Strofen 
IW :9*4/*T unzegrdr[n3 uoapueporiTTI 
DWN IVIDVITONIVH 
KR :9*4/*TTTH eueorieue vosied 
W "org eprerepv/'' I STITqou snineT 
AWT avaovunvI 
W PTO'N/9APTE*L'S(STSTA) SSUSTTRLIFSNO wnuzedsoTpI 
H og epre[epv/'xXurT('*T) xooəezd snuqueuouTuO 
W *59'g eprelepv/''T SNPTIOTI snuqueoÁleo 
TWO IVIOVHINVOA TWO 
R erg TINOSSTH/*T esuepeueo uniesy 
WR "org epIeTepV/''T1 *e2T3eeq eTYDOTOISTIY 
SV IVIDWIHIDOTOLSIYV 
PIOÓ'N/uarus's;T('TTenWw'd) regio etdoTAx 
W PTO'N/dosser(*TTenW’a) snueezeob sowsəq 
NNW SVSOVNONNV 
Tt dnowd 
"'SNOGAZTALOOIA 


Du Du 
>>> 
zzz 
o 00 
xuo 
M Ox 
H 
HAA 
> > > 
naa 
H 
aaa 
MMM 
aad 
d 
aaa 
Wau 
OQ Ay Du 
oou 
dadd 
Pu Du Du 
Ay Du Du 
222 
» > > 
ou 
dud 
Bee 
b) nm n 
[n Da Du 
Mox ox 
JAA 
doud 
ove 
MM bei 
Du Du Ou 
Du Du Ou 
=== 
>>> 
Mm b ox 


Du 
> 
= 
o 
x 
A 
d 
A 
> 
[5] 
H 
El 
x 
H 
a 
a ni 
El 
ao 
o 
[2] 
Du 
uo 
a 
Ca 
o 
da 
E 
El 
fy 
x 
x 
= 
a 
> 
fu 
Du 
= 
> 


= 
o 
zZ 
d 
A 
A 
> 
a 
H 
El 
x 
st 
da 
aa 
El 
a 
D 
A 
A 
A 
A 
Ee 
o 
A 
E 
El 
fa 
x 
x 
[2] 
o 
4 
Du 
Du 
= 
> 


A 
> 
= 
o 
= 
H 
Hn 
a 
> 
a 
- 
El 
x 
< 
A 
< 
El 
o 
0 
d 
A 
A 
A 
> 
0 
E 
E 
El 
fu 
Mm 
x 
A 
o 
m 
ou 
D 
= 
> 
= 
= 


3d 
3d 
Ad 


< < < 
HAA 
na m 
O n m 
000 


"I d 
"I d 
"u d 


aS E df EC T1343 93/Ad4d MA 
TI93/AL/dd M A 


Du 

> 

= 

o 

o 

d 

A 

Hd 

> 

a 

> 

El 

< 

< < 

HH 

H 

aaa aa 

al m 
o 
o 
E 
A 
A 
= 
> 
o 
3 
E 
El 
la 
x 
x 


Ay Du Du 
>>> 
> HH 
Hho WHA 


d 
d 
d 


= 
o 
x 
H 
d 
a 
> 
a 
H> 
El 
x 
< < 
on 


D 
>H 
z 
o 
X 
Hd 
E 
M 
» 

H 
aa aaa AZ 


>> 
El 
x 
< < 
HH 
on 
q nl 
uo 
ao 
d 
o 
Du 
A 
> 
uv 
H 
E 
[o] 
fu 
x< 
x 
x 
oo 
Bl Di 
Du 
Au 
= 
> 
== 
= 


*əursoz:K3 ‘x ¿ueydo3dA13 “M “SUTTRA “A !@UTuoəzu3 “L feutizes “S :eururDie 'y :eurue4ani[b ‘6 ¿eurtoid 'd feutbezedse 'N 

feutuotyjew “g feuToneT “T ‘euTsAT ‘y :euroneT[osT ʻI teutaAT6 '9 feutuetTeTAueud '4 :proe OTWRINTÓ "3 ‘proe otqiedse “q !əurueTe “y 
:sproe ouTwe op reurure4-N O3 Aeg (€) 

‘pees pergra1ieo-':g:2 'uorqesTueDb5:O uoieeseu 

TeTIASNPUL pue OTFTAUSTOS YITPIMUOUMOD)=*O*H*I*S*D “PUBTRLOZ MON="Z"N “PTTR1ISNOY urəqsəM= V M f!€T1032TA-'OTA /eTTerASNY uqnosS=- V S 
:ipuetsueenD-pIO ¿A103T1189L UI9YITON="L"N ‘SƏTPM YINOS bg MS H :sej3eas uer[eriasny *(s)uepie5 (18) o—TURB]08="9"8 :(pea3e3s ƏSTMIƏŲJO 
ssəTun epre[epy UT) uəpze5 AMIPATIA="D'G :ueuroeds əy} Jo eoinos Əy} Aq pemoTTOJ ST yseTs e "Zong pue əweu setoeds ey ie33V (Z) 

` (g861) zəqəM JO uorqelTAaəIqqe 

zəa3ərT-əəzu3 9243 eATH om eueu Á[ruej uoee 1933V `T əTqəL PU? qxəq Əy} UT peutetdxe se sdnoz5 Aq pəĵuerre eie suopəTK3oorTp eul (T) 
:5970N 


`səəuənbəs pue perpnis səroədç `Z 318vL 


eo 
e 
£ 
o 
+ 
o 
D. 
D 
£ 
o 
5 
> 
c 
p 
ee 
20 
O 
OE» 
"Ig. 
SS ° 
tg 
Gc ° 
2<X0 


Volume 78, Number 2 
1991 


Cd 
dA/IM/'TNS 


d 
d 


Du Du Du 


Di Di Dr Ay AY 


Di Du n. Du 


Du Qu 


A 
A 


H 
=== 
x x x 


M 
M 


= = 


= === 
OD OO Oo 
z Z x Z, 
x modo 


= = 
oo 


=== 
ooo 
zz 
DS o 


5 
5 


oz 


A 
3 


an 


z D x 
AHH 
HAA 
HAA 
94 > > 
Q m = 
>>> 
Bl E a 
ix b ox 
< < < 
HHH 
a S OH OH 


Z, eebe x 


< x x > mx. 


= x 
HH 


HHH HH HAHAH 
P4 > 
DO 


94 P3 P4 
mono 


P4 DA > P4 


anaadan 


on E Ol 


Hw 


B1 ni Bl n) n) nl 


Bl 5l El 
* oM x 


x< ox x x 
mag mm sees ARAARA 


x x 


HAA HA AHHH 
< < 


O O < 
B J El 


aa aa aa 


< oO Oo o 
O m o «dm 


<> >o a a El Ol E 


a O O fl 
E* E* E4 (OO 


Bl El El 
HAA 
NHN 
n 
Py Oy Du 
e, ` Ay 
AHH 
> >> 
D nu 
AHH 
EE 
a a E 
> fu la 


Bl 5) E ma 


aa 
Du As 


aaa 
Eo 


OQ F F o 


nn 


HAHA 


HA 
Du A 


A A 


HAA 
Du Qu A 
A AQ As 
HAA 
> > > 
nnn 
AHA 
EE 
Bl 5l El 
Ba Da Eu 
*X ox Se 
x ox x 
HAA 
ozo 


Ou Ay 


Oy Oy Oy Du Du 


Pa Du Oy Du 
HAHAHAHA 
94 P4 P3 > 
nan 


HA 
aH 
nn 


AH 


E E* E E 
a q nl nl 
fa fu [Ds [x 


BR 


Bm a 


fu fu 


x< x £M xx 
d x x dÇ a 
dq = z z = 
DS SS Oo 


x x Se bebe Se 
x< Se 
HA 
oo 


x oM x x, 
HAZE 
oozo 


oo 


oo 


oo 


Hd 
Du Du 


a o 
Qu Qu 


awa 
Du Z0 
=== 
>>> 
i x, Se 


A O Q nx. Z 
SS SS 
>> > > > 


=== = = = = = = 
>>> >> >>> > 
OOM xx 

z= 


>> >> >> 


>> 
x Se 


OO aa 


W 
MN 
W 


ZEEE 


W 
W 
W 


/STUueeijS('uuepog-:uneg) sr[erieje[rnbee enbegoua3oN 
*S'g/*1 snot qeatAs snbeq 
OWI IVIDVIVA 
PTO'N/(vosuyor*T ‘aou "dei ewoz3souwAo 
'5'g epreTepy/ieqets PONPTÓ PUTIPNSe) 
D'a SPTETSPY 
/uosuqgop*T(*brW) PueriSTTSnu eutrensesoTty 
SNO AWaOWNIYWASWO 
'9'H əpteTəpv/ uruoq ejeij3? eTnjeg 


'5'd SPTRTOPY/TETTTED TTZOTMOWTxXeW SNUTY 
138 IVIDVINLIG 
s anous 


'D'gH SptelTepy/*ppem snequehze snin3dtg 

'5'd epre[epv/'T eorepn( eriejerieg 

Lun IVIDVIOILAN 

'9'd epteTepy/*boer erroztazed snuTn 
P119qUED OYISO/*youeTq Tuoszepue eruodseieg 
WTA IVIDVNTAO 

'9'H eptelTepy/*T VATE snzoy 

'9'H SPTeTSPY/*119N(*1NOT) suepueos sntowny 
HON IVIDVION 

v dnowd 


"'5'H Sprerepy/Aqiemos (6rddəoa) eotuozewe er10321A 
'S'd/'xooH eejuebt6 eeeuduAN 

uepieb or[qngd eiiequej2/uoi4tV'l'M eueApe Zeudng 
WAN AVAOWAWH GNAN 

“OTA erquebeN/ur[eus Tieqezyos PTUISPIG 

avo IVIDVENHOEVO 

€ dnOu5 


'5'd/'389g snqoTt13 sn[gnounueu 

"D'a 9PI*?T9PV/'T Süpr3eog snI:oqəITəH 

'9'H SPTeTSPy/*T PTTSOTATA Stiet 

'*9'dH 9ePT*ISPV/qr?41) eueriepuei st jeweTo 
'9'd/'1 staebrna erbeprnby 

NVH AWAOWINONONWY 

*9*"d/*T ST[?UTOTggO etzeuny 

'S'd/'ueT sr[Iqe3»eds ezrqueotg 

'2'g/'*1 eTeauerQio rieAedeq 

'D'd SpteTepy/*weyd eoOTUIOJI[eO ?IZ3i[OUuosuo$s3 
dVd dva3oVvddAVdVd 

"Org P119qUeD/STSTA(TISOA* A) eurazne[ ediesdAH 
‘Do'd epreTepv/*od('qunup) snqoTtz3 snTnoooo 
SNH IVIDVAYUIAS INGN 

"org YSINQUTPA/*APA 3 ZINY PIPUISITA eTeQqeztpreT 
:9*8 Mey/3euouei4 Ttsebrey eousteoeg 

“O° epreTepv/'euoeq(:33noH) e3eutrnb etqexy 
UWT dV3oVIVHVZIGHVI 


— —svHyoehÉxpo L 
EE 


'penunuo) "7 318v] 


Annals of the 


302 


Missouri Botanical Garden 


eg 


d 


A 
I 


A 


H> 


>> 


CG SS 


M 
M 


= = 


= = 


= 
o 


zzzzzz EE 


9 


00 o0 


Do OO o 


yx 


N 


e x 


m z, x o x x 


T 


1 


AH 


HAHAHAHAHA 


A 


a 


aa 


ar 


aa 


A 


H 


BIO mm 


> > 
5l El 


>> > > >> 
ËJ P) aa G 
KS x 


El El 5 El q n 


x x 


D El 


5 


x x 


E 


< < 


HH 


ae e «mo ee < e 


< < < < < < 


"I 


no ao an 


a o al El 


HO aa 


o co «ao 


Q 
D e 


m z Se Oo 
< Dl nm m a a 


a 


OO mm 


E o za 


YN Du 


s 


1 


d 


d 


1 


XA 


s 


nn 


T 


L 


a 


Bl n 


BJ pl DJ pl nl El 


d 


3 


x< x 


M x x x x x 
M M x x x x 
dd dd 


x x< 


HAH 


d 


zz =F 


= = 
>> 


SS EX 


= 


zzzzzz SS 


A 


>> 


>P > P > > > 


Oo oo OO 


OO 


x Se 


H :9*4/*1 eTermm unrpodouəuo 
W `s:`S/ `T sTi*bInA 839g 
NHO Sv3oVIGOdONSHO 

H .5 °g epreiepv/3ezrrod ?"jeio[oo eueTTtS 
H "org epteTepy/*T umsoquewoy UNTISPIS) 
AUD — SV3OVITAHdOXHVO 

Hñ °° a/ *"[ voquebie wTSO[9) 
D "org epteTepy/*x00H(*boy) euetTysreywe eesog 
WAW IVIDVHINVIVHY 

8 anowo 


HB *5'g eəpreTəpvV/TesToT eoruueATÁsuued eotzAK 
HAN IVIDVOIJAH 
W org əprəeTəpv/ `T PTOI sue[bnp 
W '5'g eprertepv/uooy('uuebueM) s$7sueourT[TT PÁTED 
Sar IVIDVONVIDAS 
L danous 

PIÓ'N/eituM'1'2 giereg: Ap 
E) xe “1194 (A9TTRPE"H'4) 1411049 erTuəeozIqsuzəkLr 
W "wes og AQzZOT aW/'lÁzsÁzs sTIe[[TX? eTuopIoS 
val AVIDVIHL 
erg SPTeTSPY/*o0q eeindzndozj3e euyoo 
PTO"N/AR19"Y PPTITU Pebprauexoeig 
HOO TIVIDUNHIO 
K :9*a4/BT19(*PTTIM) suepueos PTI19QQTH 
W *5'*g eTTEqueD/TTTeweW ( Oq) VIETE eTUƏIITG 
TIG IVIDVINITIIO 
HB *5'g eprerepv/'uiv 3 IUBTM unueriexoou unoTiedAH 
W ‘L'N UTMIPO ONISO/'TI eueqsobueu etutoreg 
ATO AIVIODVISATO 
9 dnOu5 


W erg epteTepy/reAeW’D vorsied eT4qOIIQd 
W *5'g eptetepg/i It PNTITORIA]S azeqweptnbtyT 
HWH IVIDVAITAINVAVH 
R "er äi Inqor snozeny 
W ‘Z'N org I1?30/Peisieo(g^xooH) TIPUPTOS snbegou3oN 
'9'H Mey 
H /zəsseəezyx(zəTpug 3 brddeog) ortrund snBejouqaoN 
'9'g8 Mey 
/pa3s1809 (197puga 3 brddeog) eresord snbvegoua3oN 
MSN pueTbug MƏN/TTƏNH'J Tezoow snbegou3oN 
"Z'N "ag VOJBUTTTOM 
/peasieo(*3'xooH) TTsetzuew snbejgou3oN 
erueusel/*3'xooH rruunb snbejgouaoN 
*Z'N org TIPIO/P9ISISO (*J"XA00H) vosng snbegou3oN 
‘ota KemqO ede5/pregue3so rrueuburuuno snbejgou3oN 
"org M9N/P938190 (T9ITH) sepror[naeq snbegoua3oN 
*5'g ^ey/peisieo(ie3sio4) eor321eque snbegouaoN 
eruope[?? MON 


zz 


= x= SS = = 


—=——— Í 


'penuguo) 'Z 318v] 


303 


Angiosperm Phylogeny Using Protein 


Martin & Dowd 
Sequences 


Volume 78, Number 2 


1991 


TVS SY32VOITVS 

4IMNXNSTTAGAZ2NVTISSGQSTGQJTASTLISAIXNNTOTG4dMAODR ‘O° epreTepv/'T PTOSINT epesey 
asy av3ovassaqu 

IMOXNTTTAGTGAS33VTS33VST4dTXSTLISA4IXUETOLGGdMAOR TTeMeH unjeroqiy uoÁAT/-ueT eriegre[o eBurioy 
NH SV3OVONIHON 

JIMNNTTIXASAG3NVI33SLTGQd4TASTIS4NXMN5T4GdMAOR ‘O° @pTeTəpV/ T eueorige erreqobbry 
JAMXSITTXASISNVTOS8LTGQAdTIASTISANXNTDOAGAAMAON “O° epreTepy/uoq'q euri3s?[eo eiezy 
OTA IVIDWILUNODWIA 

JIMNXSITTAS3AS33YSTOÜSQLTddTIASTLSA4XNXNXNOAddMAORN PIÓ'N/'38*H PIOTITPNU səqəwerzə, 
IIMDATITAZAINSTIO ALL TO TAS TLIANNATOTAGMANH "O° SPTPTSPY/*T eurqeuueo eosrieq 
iva 3Y32VOSILVG 

dAM9STITTXOIGSNTTÓVGSTGIATASTLISA4XNNTOTAZANANN '9'd/*1 oded eqrqznono 
dAMO9NSTTXAQUSNYXSTÜÓÜSNSHOIAJTASTLISZ4XHTSTG4dMAXNR '5'd/*1 snar3?s srunono 
ono avaovlIgunono 

IMNNTTTAGAGS3XYVTI3Q0QSTQ4TASTLS4NNTSOLLIGMAOR '5'd/'1 esoutds əuoəTo 
aao AVIDVIVAAWO 

JSIMNNHUTTXAGAG33YOTSAGLTGQdTASTLISAJXNXNSOIdGMAXRH '$'0/*1 snatqes snueudey 
JIMNNHUTTXAGAGS3335TSAGLTOQGTXSTLIZAJXNXEXNOIGZJAAOR '5'd/'1gd'H e[eurorggo UNTIINISEN 
$IMNNS3TTAGAG3335TSAGSLTOQATASTLSA4XNN VA da MAN `S`2/`T sndeu eotssezg 
waa IVIDVOISSVIE 

OT ang 

GQIMMMUITAGIANYTTOGGLILATASTLAUANYNIOZLAMAYUR ‘O° SpTeTepy/*F'T eueorige eruuwuiedg 
dJINMSNHNTTTAGAZNVIOLALTAdTASTIZAIIXNXNNIdAAMAJXZR '9'8 PPTETOPY/*T STTeqUepTooO ?IMe15 
TIL SV3OVITIL 

dJIANSIITXAQISZXTTÓZLISTOATASTILIZAINXNTOSAANMAXNE "D'a 9PI?I9DV/'3 XooH Tszeqseu eAequog 
SIMNSTITASASNTTS331STddTASTLSA4NXTOTAdMAJXH ‘O° epPr?IepV/'TTeng:4 1o[oosTp uojtuoAuoerg 
UIs IVIDVITADIILS 

dJAMNSHTTASAGSXNVTÜS3QLTIAJdTASTLZAXNXXNTOTGGdMANXNR '9'8 ePrelepV/uoq:9('Ae2) ea3eTlequn eeolereeudgs 
dJAMNNUTTASAGSXNYVTOÓS3SLTd4TASTLISAXNX3TDTGdGGMAJXAR '5'd/*1 ?io[gTAied LATEN 
dJAMNSNHUTTASASNVTÜS3ILTAdTASTISAJNXNXTOTAGMAXR '9'd/'A?2(T) Pesoz eeeyity 
ATA IVIDVATWH 

dJAMS9XTTITTAGIGSNVTIVVLISTa4a4TASTLISA4NNTS5IGG4MAOR '9'8 ePrelepV/'TTenW A stpuez6 sndieooe[g 
4AMS9XTTTAGASNVTISVVST4d4TASTLSA4NXNNSOT4d4MAOR "O° PI19qUPD/*TTONA*I eorse[er3sne PTTOJOFSTIV 
O13 sVv3ovauvoosvIa 

4AMNSTITITTAGAGSX4VTÓS34STOIGTASTIGSAXNXNOTGGAMAOR '5'g ezodebuts/buty eio[groned vezoys 
4IMNSTITTITXGASJ4VTIÓS3a4LTQOGTASTLISA4NXX4ST4d4MAOR "98 erodebuts/reuqzeeg eor3wuoie sdoue[eqoAig 
24d svsovduvoowsildIG 

JIMXNXXTTTAZIS3NTTSOISTGAJTIASAJIZANXNIOTddAANE `L N UTAI?Q ONISO/*T1 SNUTYISTZ orzng 
SIMXXTITASISNITSS31ATGOIdTIASG4ISA4XNXNNOT4d4MAOR '9'g eprelepv/uauny stubtsut PTSTIOYD 
wog IVIDVIVENOE 

6 anous 

JAMDINNTTAZAZNILTITSIOSTAATASTLIA MN W/DVALAMAOH '9'8 epreTepv/'] POTOTP eooeloaAud 
dJIM9XTTTAGAGS3NVISS3NTQdTASTLS4XNNTSALGMAORVS qQ310N/'TT9nW*4(*3seq) SNTTOJTUTIOS sndreoouopoo 
LHd  HVHOVOOVTOLXAHd 

dJAMSXNTTAOAGS333TTÓÜSSLTIdTIASTLISAJIXNOX SI/IHIAJMAOR "og epteTepy/*T edetel STITABITH 
q4^AM5NNTITAÜASNTTÓSG4LTAdTASTISAJNINSLGGgMAOR ‘D'a eptetepw/Astoyo exqe[b eet Ttautebnog 
DAN SYSOVNISVIOAN 

3IMS9XNATTANAS3VTTÜÓSLLTddTASTIGSG4NX3T5TG44MAOR "ST ?eoere[o eroeurdg 


—————  . .. — — — — — — À— M — L—À—'ÜÀ—À —— 
ke 


‘panunuoy “z 318v] 


KEE 


d 


d 
d 
d 
d 
s d 
Ka d 
E: 
9 à 
8 
€ : 
2$ 
Es? 
_ DI 
o5 a 
3 3 a 
O 
E Q 
< > a 
d 
d 
d 
d 
d 
d 
d 
d 
d 
d 
d 
d 
d 


304 


H> 


M 


M 


TE 


T 


HA 
A 


A 


A 


Ci 


a 


A 


A 
A 


H> 
ET 
x 


n> 


>> 


H 


EK? 


Y 


on 
x 


v 


d 


HA 


ad 


HH 


“< < 


A 
na 


d 
aa OO aa 


V 


a 
a 


q 


L 


He 


q 


L 


BR 


T 
Y 


q 


L 
d 


a 


a 


A 


d 


x 


3 


9 


5 


M 


M 


AON :9*d/ 1198 xeu euroÁTS 
Tdd AWAOVNOITId¥d 

ATH *5'g epteTepy/‘yqueg eoneTh eueeoneT 
A TR "org epre[epv/Uuoq'5 xe 'uuno PIPTIQUTJ] erovov 
WIW IVIDUSONIH 

A XK "org SPTPTSPY/*T soyuqueoetzq PTSITPSTO 
AON *5'g əpTəTəpV/ `T enbrTTs eTuoqerIəo 
Tso AWAOVINIdTWSawo 

FI düOu5 

A X H '2'g/aeqers eqaəeTnorued eebuedpAH 
AAN '5'g SPTRTSPY/PONOIPL-PATTS TTIPTWYOS x eruebieg 
LA AVIDVIWVHIIXWVS 

A DN erg əpreTəpV/ `T srunuuoo sniÁd 
AAN "org eəpreTəpV/ dien PITOJTOTTT snundid 
sos aWaovsowd 

AWA *5'g PlTeEequed/*Aipuy PTTOJTIPITISS PUOOTTTeD 
ANH erg eaiequeo/'lIpug unsoursei un[ej3edoueudy 
NOD AWADWINONNS 

£T düOu5 

AWW "(^S org A2307 3W/ `T wnot quod uozpuəpopouq 
AWW "rg SpteTepy/*T euyoerpue snijnqiv 
Iu Kess 


AWW ‘W'S uoeed s,xXOTTTES/ASeTpuTT snzoTytarzed uobodooneT 
“W'S 9SINIPIUOHN 


A X H /'l1eng'* I (repuos) Seprotude3souo2 PuoToIISY 
oda uvaovaluovdsd 

cl dnowuS 

ARR `5`g TINOSSTW/*99NZ 3 ieqers eoruode( eiis 
A X NW "'D'd tznOsSSTW/STTT#H SUTTOISO ese, ey 
ALS AVIDVIVUALS 


*'9'd 9ePIPTePV 
/S1I19TG(*ITSNN* A) STTeTAsne e[ieuougoueid 
LH "org eprer[epv/3ee^s xe 'uuno edieooueAo sdosnuty 
Las aWaOVLOdYS 
*5'd/43euoueai4 seprooe[eu e[nurid 
:+9*4/ ua] unue4r[odeeu usweTo Ao 
ud IVIOWIANIVd 
"org eprerepv/zew('ag*H) STTTqeTzea Pouedey 
"908 9PI?PTePV/TIU?A STTTUNY PISTPIV 
Sun IVADUNISHAH 
TT dünOu5 


>> 
n 
= 


>> 
x 
= = 


> > 
== 


W *9*d/*I PIPIOPO PTOTA 
XW '5'g OpteTepy/reyszog SNIOTITWRI SNIÁDITOH 
OIA AVIDWIOIA 

"org eptelepy/°*T eotuoTAqeq xtTes 
"erg ƏPTETƏPY/'I e16tu snindog 


'penuguo) “7 1184Y L 


305 


Angiosperm Phylogeny Using Protein 


Martin & Dowd 
Sequences 


Volume 78, Number 2 


1991 


—————————————————————————————————————————————————————————,——————————— 


GAMOSTITAPU/IITANTIVOSNTASTASTLAANYNANOTAAMATIN "94 eueqstig/*T POTPUT ?iegrbuvq 
VNV AWAOVICGYVOWNY 
LT dnOu5 
d4I1M5ST1171T17AS3AS3VVT10Ü3VSTLILaTASTLSA4XNNT5LSGdMAORNM *erseuopuI "org robog/euntg eorueAe( PTSOQUOIIS 
GIMONTTTANATYYTOOGdSTLATASTLAAN NI//0 Ld/IGMAO NW '9'd ezodebuts/sizeqsey eeoequewe sAyorqsouryoo 
970 IVIDWIVIO 
GIMOSTTTAGATNYYTOBASTAATASTLAANITITOSOAZLAMAYUN `9°8g epre[epv/'1 eotuodel snuAuong 
GIMONSTTAGATYOTAAASTAATASTLAANIU/MOILAMAYUN `9`g SPTPTSPY/9ZFUNY (18Y90H) geotttded eutssep 
Tao SWAOWALSVTIS 
91 dünON5 
d4IM5SIITAS3IS3MNTTÓSILTGddTASTISAINMTS5SGdMAORN uedep/rexeN eoruode(-ivA*q sueqeu eder] 
Vul AVIDVAVUL 
GAMOSUTTAGATAOTOVASTAATASTLAAYNTOTA AHA SN ‘O° d/*T-wnqeuez5 eotung 
Nod IVIDVIINOd 
GAMINYITAFTAGHITOGTASTAATASTLAAGYUNTIOZAAMANG ‘W'S ebptig ÁRIINA/*G9p9T ejoTz4s eTIəuqoueoO 
GAMONMNSTIAGAGHYOTOZAC LIGA TASTLAANYNUNOTAAMAYN “Y'S 5ueTtW/useaey(yquny) septotded erb5rpnT 
*S°g TINOSSTH 
dAMNNMS/WITAGAS33uT7TT10301171a4dT71A4ST1LS84NT1T153a4dMANGd /2A0TP9918 3 ueAeu(uugujrTd) eipuereues erzedoT 
dAMDONHTITASASMTTÓSLSTddT1A4SqT1LS344NX458ddMANd "5'd TINOSSTH/*D0 suebe[e eAney 
IAMONSITAZAINITO 30/11 710/dd TASTLIIANTIO Id MAN J "9"8 epPrelepy/'Aei5:v 3 ‘wlebugq TISWTSYPUTT eineo 
dAMS9NS"ITASAS33ITÓSÓLTGGTASTLSA4UMNETSOSGddMANÀ '5'd/S$0A4 eprQiqAu etsyong 
GAMINUTITAAGATAITOTALTIATASTLAAUYNTORAAMANA “O° d/*FeY uneto unrqorrda 
GAMONUTTAGAGTWOTOCAYLTAId TASTLAANYNIOTAAMANAG `9'g SpTeTepy/usesey (SUSSID) unueo unrqorrda 
GAMOMNTTXGATWITOGCYLIT4dd TASTLILAAYNTIOAAGAMANG *9*d/*TPUTT P3eTnoTnóun erxieTo 
dAMNNE'ITAGAS334ITÓÉGLLITdGTASTLS4UNMNTSTd4dMANÀ "D'd/STMeT] 5 stuet pur epunorqnai eTyTeTO 
dAMDNHNuTITASAS34ITÓGÓÉLTIGTASTLSAJNNTS5SSGddMANÀ '9'H TINOSSTH/STÁOY PIPPIO) ?eoil2 
WNO AVADVASWNO 
GAMONMMITAGCAGTAYVTOAUTSSTLAGTASTLAANNAOLILAAMAOR 'S'd/uepreW(ueprew) edzeoorotu snqdATeong 
GAMOSTITAGAATAVYTOTASTLGTASTLAANNAOLAAMAOHRH og epre[epv/Aized 3 “IEW (JeTTOg) TTYITUS eueuoy 
LUW JVAOWILYUAN 
@@IMOSUMITAWIOWITOGTSSTAGTASTLAAUYNANATAAMAON "9*a/*ubo9(*9d) LULƏTTTAIN PUTYINOQTI 
dIMDNNITAVIONTTOASSTAEdATASTLIAIANANNIdadaMmaAodn "'M'S'N Qq32oN/uoq:iq eurgge PUOIJSPTON 
SIM SvaovIVWOLSVISW 
dAM5M'ITITAGAS3MVTOÓLILTGdT1ASUILS4NNM3ST1a4dMAOR "9'd 9epPreTepv/zanyg(*T) eseoor3onig eTpiogpooM 
dAMS5MN'ITTAHAG3NT1T793SITddTASTLS4JXNGETSOSdGdMAOR '9'd ePIeTepv/*T erTivor[es unriqaAT 
LAT IVIOVIHLAT 
G@AMONSTTAGATATTOOSSTASTASTLAAYUNNOTAAMAYUN '9'H TINOSSTA/*T POTPUT ettenbernn 
GAMHUNWYTTAGATATIOITASTASTASTLAGANNNOTAAMAYUR "o'd 9PT*TePV/'qxoy unrpugoep unjeiquo) 
SNO sV3OVLSSSNOO 
ST dünOu5 
S3AMSONSTTKCQARXSTOSILITCQRaTASTLIHaXMXSOISaGaMADON `S`O/X9ZƏTTM('T) PIPTPRI PUÉTA 
GAMONMUTTATATATIOGM/LITdd TASTLAANYANOTAAMAON “S°O/°T Pez] BTOTA 
GAMONMUTTATAGATTOGCALIT&I Md TASTLAANUNAOLTAAMAON “S*9/*T wnat jes unstd 
GIMONMTTTITATAGAVTIFTdsTaddTASTLAANUANIAAMAONH "9'd ezzəqueO/əonIq(-quəA) unqeloesoSšueT wntqoTAxo 
dAMSONHUITAGASNVTSSLILTGdTAS'ILISA/J48N TOA/Ia a MA YN `S`O/ `T snrTograsnbue snurdnT 
dIMS9N'TITTASAS3MTITSS3SST1GdT1A4ST1LSAXMNTSIGdGMANMA "D P119qUeD/*UUND*Y PTTOJTINDR veAOH 
dIMS9NTITTASAS3MNMVTS3S311Ll1T1GdT71ASTLS4JXX4XNST1d4dMAOR "9'd *I1equeo/*qsT[eS PTTOJTIOT *Tpoo5 


'penunuo) *z 318v] 


Annals of the 


306 


Botanical Garden 


issouri 


M 


nn + + + o eee o 


d 


d 


Du A Du 
H>H 
zzz 
O x x 
man 
HHA 
HHH 
HAA 
> > > 
mano 
>>> 
9 a n 
x x x 
«og 
HAA 
< O 
9 n) nl 
nn 
n A Oo 
AHH 
Du Ay Du 
Du A Du 
AHA 
= > > 
ann 
HAA 
E EA E^ 
Ë] p) n) 
fay És Du 
> xx 
x x x 
x, xx 
000 
HAH 
A Ay A 
Qu Du Du 
=== 
>>> 


I 


A 


M 


M 


z= 


= = 


oo 


== 


x 


5 


s 


s 


1 


1 


T 


1 


AH 


HA 


T 


T 


A 


A 


q 


a 


I 


A 


a 


a 


XA 


3 
3 


x x 
< < 


T 


Y 
Y 


< < n 


HAA 


1 


T 


OO a OO a 


a o 


RI 


BIO mm 


E O n 


a 


a 


q 


L 


L 


L 


1 


I 


a 


a 


d 


d 


“E 


I 


A 


A 


s 


s 


I 


T 


L 


L 


3 


3 


1 


9 


N 


oo 


00 


OD OO 


" 


A 


d 


d 


d 


d 


= = x 


A 


A 


3 


Eo * nm oz a x Oo x 


OM 


a mM Oo MO OM 


x x 


KW “vs org A3307 3R/edoH esoqoTS etetppng 
SOT UWAOVINYSOT 

W "5'g epteTepy/reAen’g suedseqnzy unTud:Io 
W :9*4/U00*9 unuerrTəssnr: PWOJSNZ 
N35 AIVIDVUNVILNIO 

W '5'g/'aig'u esouieo eAoy 
SW IVIDVIVIATIOSV 

H '5'g SPTeTepy/*T 1ouTu POUTA 
W ag eprerepv/uospooM(UOAed ? ZTMY) erer PITTASPURH 
odv IVIDYUNADO dW 

oz dnOu5 


H *'M'S'N PTeuerTeg/*TTONN’ A unaeTnorde wnt TAydobAzZ 
DAZ IVIDVITAHIODAZ 

K "erg epre[epv/'1 snCeu unToeedoz1] 
dul AVIDWIOIVAIOYL 

A'9"8 eprertepv/'ssnp:ipv(-ueT) un3erTro uoTTAydeubt IS 
R *5'8 eprgtepg/ It ezqeTS erubrdTen 
d'IN dV30VIHSIdIVN 

"erg epreqQepy/uoarV( `T ) unaerirnono unruobie[ed 
:*0*4/*19H,T unjeuo»sou unrpoig 

was EVKRVALKCKG 

61 dfnou5 


mx 


W "AH urMieq/-iiew('inoT) PIPTUYOPIG eTTTeIeD 
HR `L`N uTMzeq/noy Burg eqeqstzexe eiaernbnig 
W APMPIPS 159104 DUTYINN/ “TITO eyotastp ee TTAydostuy 
ZHU AJVJOVUOHdOZIHA 
"erg erzəqueo 

W /Pa*eqozo:g*'V('TTenW*4) rTue[ieneq uorpuepobez0T eH 
W "erg eiiequej/pieuoio'g^vV sntTAydoezo sndIeoouos 
TWH sN 3OVSOVSOTVH 

81 dünOu5 


Ww "D'a eprgtepg/ TUJ TTSUTTM PTATTH 
W "og eprerepv/eiburMS (ISTITH) PUTSSTITO snyqueTty 
UWS aWaoOvanouwWiIs 
H'9*8 eprtetapg/ Tram a xe uououeTd PInpued PTTINAIPH 
W ‘ota usAnog/*boer esoosta eeruopog 
dvs IVIDVANIAVS 
W '5'g SPTRTOSPY/ÁSTTRE PUTTTOS eTserepuT Ta 
W *5*a/3əəmns xe Áexowey eTTeyotnd eezzo) 
W "ag epreTepv/'qunup] esuedeo unripuepo[e) 
LOY AVIODVLOY 
H :9"8 epre[epv/;s$np^T'V STSUSUTS eqəIrpəI 
TH IVIMNITIH 
W PTO*N/*TITOSNMA* IZ $ndaeooqouoo sni?uuo) 
W erseuopuI'5'*g 10b0g/*119N (SOUBTT) sSTAIƏUTI3 eee[eby 
NNO dqv3ovVHVNNOO 
W *'9'g əpTəeTəpV/ jJseq Pot jueT Ie VTOLRJSTA 


xI n . |F? 


'penuguo) 'Z 318V], 


Volume 78, Number 2 Martin & Dowd 307 


1991 Angiosperm Phylogeny Using Protein 
Sequences 
D DD A f DD DD ma Qu Qu Qu Qu Pu Di Au Du Pu Ou Du Qu Ou Ou Au O Ou Ou Oi D: O, Oi Os Aa 
H HH H> HH HH >> H> >>> >>> >>> E PNP GG 
= == Ss HA ud zz zz sz=z= =E E: xE E Z= RRE === === === 
O xx O00 zz MM CO oo QOO oO O O O QO Q OQ O (Q (Q (O QO (O QO (O (O O QO O (O 0 0 
Zon St MM MO MM MH zz z = Z Z Z Z x Z Q Q Q Z Z Z Q Q Q Q Z oo 
q dd nun MH AA AZ MA AMMMM MMMM SM AMM MAM 06 93 OG m xú m m 
d HH HH HA AA HH HH ü AAAAAAAAAAAAAAHHAHAAAAHH 
q AA ud dd aud AH AH HHHdüHdHHHuüd8H84ddddg48dd48d888828d43 
> >> >> Sa Eu fu» Sab Sab > 4 Da DA 4 4 DH DA 4 4 P+ P+ P+ 4 4 aa abba Pn a ba + 
A aa ma mui An on oo AAA AAA ee AaAAgRAAAAAAAMARDA 
> >> HH >> >> >> >> FUEL EA er eret pe c Melen EY ERO FAL DECR 
D] HH PF m mo mo OO PPO tM m m m al PB) p) Ë) B PJ B PQ OO BJ P D) PB) 3 
Mo MM MM MM MM MM MM X0 B d d AAA MHRA < 3 3 AM RM 
A HH HH WH dd AH 44 udddHüHHHHdü88848u8uddHdHuuudd 
d AR da HH uad AH HH HHHHüHdHHHHuHB8BBHHdddüddddddudgd 
a aa oo “a ay oo ga AAAA A OA A A A A A A OA A A OA OA OA O OA OA A A Y 
<< Bw aa Hea ma ma ob Pl Pl Pl Fl Fd a Bl Dl dad pl BJ PJ [3] ada pa [a] ddd 
m om EE no wh HH AA dHAAAAAAAAAAAAAVAAAAADRAADAA 
BH OH Be HO Oe AND HH DHNNNNANANHHNNNHRNNNHNHAHHNHNN 
A dd AA Ha ua HA AH HHdüddüHdHHuddddlsldgdgdg8tfsfafsdddd 
H AA ama aa AA MAA ma ANAADAAARAAAAARAHAAAAZHAAAAH 
D MA wo DD Ge Ge LA QA ` Gu Qu n n Qu Qu Du Du -V DD D Ge De Dn o o Dp Ge Ge x Du 
M H8 AA aA vA AMY AA uHuHdHH8BHH/HHHHnddd48düun8d»88u dag 
4 Bx Sa Sa >> >> HH WA AAA 4 DH A n A DH + A DA 4 4 + A A A DH Pr A a Sa 
o nn ON nn nu AND ANH mooooouoomooooooooooooomomoo 
9 AA vA dd AA HA AH düH4dBHdHHdHH fW eee d by 
E Be Et He EA EN FE EE EFEEEEEEEEEEEE EE EH E^ EA. Et Fr E4 EM E4 E^ E^ 
a aa aa mnm mm HR HN gà) B) BJ BJ BJ Dl PJ PJ P [à] P B D fe] B? [s] [à] 9 D [3] [s] P fil RN 
L Ml mo las fa [xa Ds kb Bb os pe DA D DA D Da DA e e e P Da Da 
MO MM MM o MM MM SM b MD hú M Mi hú Ne bú MG bú MG bú bú bú Mi bd bú yú 
x MM b x i xo xx Bee E leiar sasoe ereke 
H AHA AM vd TE og no == xx Z Z Z x Ü 4 MOM MOMOM MOM M oxox E xo 
o 00 ou O O O os vo z = Z Z Z Z Z Z Z: O Z; O — Z Z: Z: Z; S O Z; Z Z z; O 
El Hd Fo B ni DI nj DH HH ERES E ed es bs Es AAA EA pt pq. ET SS 
e on Dé Ho He Dë mna DG nm D Dy De Oe D Dr D De Q Qu ñ ñ, Du Qu n ñ n n n n. x 
D Gef Dë D D an Q« Qu DD Qi D hn De Dr Oe De De De Du Dy Oy Ay Be Ae OY Ay DD D n Ge D Ge 
= ss EZ zz oru zz ss zz = E t= = i= Iz iz D Iz Iz I I E Tm mx x 
5. Sep sq» >> >> >> >> >>> >>> >>> >>> 
a xx xa Mox MM AA Äx QOQOQOQO QO QO QO QO QO O QO QO QO QO O O O O< O O O O O O 
= == xx zz = eee 
o E 
m z 
h x CS 
ks] Di . E zs 
oO e > en z "i 
` ei D OO D A 
° m o9 KI MC n c '0 
o . k 8 e e D d 
. . v 9 < 39 = Da 
Du . 0 . ke] Tas ri od Q U 
Kä Lo DËS m Es! H H n H U) -A 
. H - m gi o9 £s o 33 Ga "ve 
iu ae — Q4 d e See? ¿ga ee 
5 ei 33 "o0 dé o ge Hoe O-d o0 Du eil ei H 
X 0. ae a NA fu 0 EE vod a EG O 
° OU g — om e M Sa E Kal Du el E enog . 
fas A >» ao kel E 9 o KE WK RK Bek . AMET * A 
~ wm oa ao da ‘HO OONN NA . O Hd on 
. a v e CU Gea wo Iu ue uds ‘ORAU e Ores. s] 
+ do ar 9-4 UN O . S $w4u5.o0b5804-m. J391&0.nm 
3 Ai 55 34 an Be 396 Sita ESET Eee SS 34.2558 
io |z Swa E e 30. 59 BE omo ASNO E 00200 T Tap m 35m HÀ 
g an -4m 0H x G m z < BOSSA DIO S£ Bn TNO ome is 
s [3332225 5588825 6222: ELLE CE ERC PERS eS aS 
E H ai So: eu Ds X A PH gë gAH d KE SEJNE ES 
`= n oN e D CA A-A CG 3 KEE ga Ae 03 E EA 
= "ood: H E hx na o B9S8959Rd222,498582959252922292 
SG ae E 8 PRA EEE EE PEE REE 
© & oa EK Ë ° > 3 ç 8 ARTO DANOS BOLDER aD 
d 00 Q3 508,52 ñ e Hd 92 BuHOSZ2ZpsSHOROHDOHUOMUASu 
H Di SW wu o. n ei [e] On @ GO n sun DV Qe A G O Hedi Drei D N-A O MH 
ed o gg HD EE 5 2084840 EEEE ELEREN EEE 
* > Zr 3 MEE EEE REE EI Go PERE a 
a 
sl 23304823 No Sean SEE ADSHndadaddddaEEEEEEEEEEEEEE Q 
ca am GOoOOg O-q-AZH SCEERCEEEEKEEEEEEEEEEEEEEEKEEEER 
< COA «oo K Ë Hü qx e 9 G o E ox SKkkkRrK KK EEEEEEEEKEEEKEKEERNEN 
= roo Dnood.so 255 SEEEGEKFEKEKEEEELEEEEELECEEERG 
BES Ps 035553 E EE gi d 
A =: Ñ Mq. 


308 Annals of the 
Missouri Botanical Garden 


Du Du Du Du Du ` Du Ou Du Du Du Du Qu Du Du Qi Ay A Ay Ay Ay Ay Ay n. Du 
HH HH HH PH >>> >> >>> > > Pb pp >p >> 
SS AH SS FH === FE zzz = E = = x x= x x= x= 
x x nu xn x x 000 x ox 000 o o tá bá RG MD b i x 
x x D vd zz aA MEE zz nnn D o z, Z, o Z Z x Z= 
HH HH HH n OG bf HH KAE x x Q >p Q ú a > x 
Hd "dd HH HH HAHA HA AABK D d dd dd dd 
Mu AA AA Hd AHHH ARH HAA d a dd dd dd d 
> > > > > > >> mn nm > PA Dn > D »^ pA > pA pA pA > > 
mE 44 ano AA aaa AA aon a a aAaanaaaa 
HH HP HH HP» >>> HH PPP > > >>> > >>> 
53 n Di Bl A fl El fe] 5) nm bb 5 5 n hi El Ë) BJ Y Y B Y E 
ix x x x x be x x Kb be ^ oM bb Ei Mm `< BG ORG b bá > be be 
HA HH HH au < < e DO < < < < “< Seege rue e 
HH HH HAH n AHH Hd AAA 4 A n=» 
OO aa B o aa i) < Q aa Qoo a a End Oo O O < 
a A Al fl El Ë) El < < e 5 nj Ë) ml m fl tl B] « aaa 
nn nn nn Pa omo AL BREA P P SMS Q WO 
Ao oH HH Pi E4 nnn RE HHH D D E oO oo 0 E40 
AH Mud HA AA AHH n AHA A A AAA Ana 
00 ma AA aa D Q0 D Du De Du pu D A A Q Ei Gu Du Du D 
D Du OQ DD AA D Du Du Aa Dy aaa D A As Ou Ay Du Du Du fn 
Hd AA Hd HH AAA Hd HHA A H AAA” HAHAH 
HH »4 » Di nm p» ba P4 An HH P4 P4 P4 » Ca PM P4 P4 ba pA Sa > 
an nn oo AN moo nu ooo o Y Q Q Q Q t m o 
AA AA AH HH AAA HH AHA d d AHAHAAHAAA 
E E E En E E E* E4 E4 E4 E SS) E E E4 E4 E4 E4 E4 E4 E4 
bl 5i fx) pil bd nl w Bl l ni fe] El Bl p) nm El D Ë) Y Y Bl B) E El 
iu fy fa fu [s Da > > fy Du Du bi > fa ba [u Du Du P4 PA Pd PA pd pA > 
x x< x x x x > x x x be x x x x x x< x KKK e x. be 
x x od x x x x x x x ix be x< x x< x x< há bá há bá D bo be 
AA AA A AA AAA EM aa vd A dd dd MAH 
00 00 oo oo 000 = =, = = =, o o 0000000 
>> A> H a H> HAA HH HH D> d A HAAAY 
HH HH BA DP ana mam DOO D A Py Q As Ay Ay Ay De 
an DD ao AA Du De Du Qu UUA A A O A As Ou Du Du D 
= = = = = = == === = = = x= x = = = = = = = = = 
> > >> >> >> > > > >> >>> > > GGG 
== be be OO OH x x x x x ox x x x x x x x bá x be x < 
= = = = = = = = = = = = = = x= x= = = = = = = = = = 


Adelaide B.G. 


á o 
m m 
o 3 
ks] d 
; a š ; LSS 
jd - = Xs ^3 
m . v m n a < 
of Š < S a o o0 
$5 o à * 9 d É A 
n ei EN 9 d kel a H m m 
d e Kk E 8 $ E o o 
ses E A - xz i31 
KEE: Du m o [^] zx t s ~ 9 vi . oO © 
NA ~ ~ = d * YN E E v o a a 
65 3” 8 i P Eo < 9 m a 3 
e E EE E ae E et 
TEA E WEEN cdo 
3 os Fama 8.298343 TEITE y ^J. 3 eS aS 8 
E Ba 88 Agos BAe SoG Rida E g4qddgddg 
-— u n on 9 o O». +» H G H Se ° Ay GME uu 
i= -A F] E o° A H A HHH d na 0 G D vA DAMA DA 
3 an GA oa Be äi 6 M 27 co» d d O VAT c5 
eg OO HO pe 43283 Be O AH ei o Y Kx A >H 0 O CG A 
d$ pé ^ “3823 SEEERE Pe "PN. d ce Z EEEE EE 
3 gn “S H A Hd ei no o n o O d o + 
N à TE oe &oc»o2 > E a w Ed ana 
13 3 GREEN H aa G o N mda e Sing oe 
a Seen woes qO 525464 qdods0 A AAA Aca 
e ZSuHmecHOgzss u gD EH gd 8 Ode Y S$S9HOdH$g9 
OHO aa wHo oes M- K H H Du E d el H ei W H G O j CG QO G O 
= BSPTRESES IICA a 1232 PO De äeudsg 9 HAGY AQAP 
Bii aaa sshSe Bllizrir 5525582 š EEFE 
asatrna donna B6aasS58 ELLA a Arddadóza 


MKVWPPLGLKKFETLSYLPPLSDEELALEVDYLLLSK 


MKVWPPLGLKKFETLSYLPTLSESQLAKEVDYLLSNKWVP 
MQVWPPLGLKKFETLSYLPPLSSEELAKEVDYLLLSK 


MOVWPPVGSKYFETLSYLPPLTTEQLLAEIDYLLLSNWVP 
MOVWPPLGSKKFETLSYLPPLSPESLAKEVEYLLLNGWIP 


BUX 


Senecio petasitis (Sims)DC./Adelaide B.G. 
Buxus sempervirens L./Adelaide B.G. 


Simmondsia chinensis (Link)Schneider/P.G. 


Canarina canariensis L./Adelaide B.G. 


BUXACEAE s.l. 
CAMPANULACEAE 
Lobelia erinus L./P.G. 


309 


Angiosperm Phylogeny Using Protein 


Martin & Dowd 
Sequences 


Volume 78, Number 2 
1991 


TAMONITTANIOWTTGVASTaAAaATE STLGUAWIWOGA 
GAMONITTIANIOWITITIVASTaAaATE STLIAJNTNDIa 
IAMDNATTANADOVITONISTAG TASTLIGAWNWUWOSG 
dIMLNSTTASISNVTV3SST1adad TASTLSA4JMNAMS5S3d 
dAMLHSITAVIS3NVTV3SST1ddT2 STLGUAWWATIAA 


GAMUNITTAGAGYVIOALITaAaTAST LSAMNMTNIG 
dIMXNNITITA2GIÓMNVITÓSLITGGTAS] LAN NNN Gd 
IAMANTTTACIANVTIOAOLSTA4Adras TL XX TOL d 
dAMXNN'ITTAGASMVITI/3839014a41A4S 1L AaAUUNTOTA 
GAMONTTTIXAAAATSYTSALITaatist L 32 3 X NS ST/Ad 
G@AMOSTITAGTAAGAYVITSGastLatisy L 3 3 X 4S/19 à L 
G@AMONTTTAFTAOSYISAZaALIGatTtisy LIJANS9Ad 
G@AMONTTTAGAGAYVISALLILTtaatisty LL IAIJMNNMDAA 
IAMDNUTTACAZNVITIDOALLTAaAdTAsari "44XNNNO5Ad 


dAMDMu'TITTAGQIS35TS3S3GIlTddT12AS LSa4x3T15'T1d 


dAMDONSTITAGAS3X3MVTIS3VSSTdd4T14S$141 LSAXNMNTSIGd 


>> 
n 
x 
< 
AH 
aa 
E 
BE 
HAH 
Du Du 
oo 
AH 
E E4 


A MN 
4d X 


A 
> 
= 
o 
n 
d 
a 
d 
> 
aa 
Du A 
A 
= > 


` A; Du 
=== 
>>> 


Du Du Ou 
=== 
>>> 


O OO 
== = 


O x al 
= = = 


'9'd/*'1 snrTogTsnaqo xeuny 
"9*d/*T wnotquodeyz uneyy 
*S*9/yo0uesoy unquernose wnz Adobe 
5rd IVIDVNODXATOd 


'5'8 SpTeTepy/ ‘wey ejernorine obequntg 
:9*8 SPTRTOSPY/9ZJUNN TTUTTOAWÉ umruourT 
SId ` IVIOVNIOVEHOTA 


"99 SpteTepy/*T unibru zedta 
:9*4/y3uny eAz30qAtod etworedeg 
dId avaovisdId 


"95'a epreTepy/ieuiieeo PISJTINU OQUNTON 
TIN SV3oVNOSHRO'TSN 


'S'o/Aei9'q 3 KAeaioj rAe[pur[ erteziuen 
WOT AVAOWSVOT 


TTeMeH unjeaioqaiy uoÁT/5zəg trryod s1yi40e7 
TTemeH unjezroqiy UCAT/JeTQny srsueuernb ej3rdnoinoo 
KOT AVAOVGIHLADTT 


"S'O/*YqUeg er[ogT3e»?ue3 eTTeoRYg 
'S'9/*UXN 3 'XooH TTseTzuew e[Tudouew 
GAH — 3VSOVTIAHdOWIAH 


"W'S 1Te[eg/eoniq(u3TuS) PPTATY e[oAeeog 
‘ws "og Á3JO0T 3W/U3TuS e3eAo PTUSPOOS) 
qos IVIDWINICOOS 


'D'd/'T srunuuoo SNUTITY 

'5'g8 epre[epy/'biv ‘TTenw TTpueurpieg uorpruooto 
'5'gd SPTRTOPY/*B1Y ren euersexy[r^ eudATeoy 
ana #V3OSVIguoHganq 


:9*d/* 3308 eequedre etpzeydeus 
'9'H epreTepv/'qunup suebund snubeee[a 
sI AVIDUNIVIVIE 


‘D'a Buy eques OYOUPY/*IINN POTUIOJTTRO PUOSOSSOID 
OY IVIDWIYNHOSOSSOYO 


‘D'a SUINOQTOSH/YITTTOM sTsueTedeu erierioo 
wad SV3oVINVINHOO 


‘L'N UTMIeQ OHISO/'TlenR'4 epuou TreUTIeg 
Tre^eH unjeioqiv uoÁT/'] ooeot snue[eqosAiuo 
SHO IVIDUNWIVEOSAYHO 


——À——— d—À—RÓ APA A Ann eee eee ee S 
e—a 


'penunuo) `Z 318v] 


Missouri Botanical Garden 


Annals of the 


310 


+ + ++ + + + nn e o O O eee 


dIiIMSSTIITAG 


d 
d 


Qu Pu Du 


Da Ay Du 
> >> 
= == 
OD OO 
bb x 
“HA 
AHA 
AHH 
P4 > > 
AzA 
> H> 
a a a 
x x x 


Py Ay Ay Oy Du Ay Ay Du 


>> 


>>> 
=== 
ia x Ó 
M x x 
Hf d 
AHH 
AAA 
94 Dn > 
aaa 
>>> 
Bl El El 


GGG SS 


M 
M 


== 


= = 


== 


= = = = = z = = 
vovvvvvx 


H 
9 


s 
s 


ix Z; i£ bb be k 


1 
T 


470404700 


I 
1 


AA AAA?” 
AAHAAHAAA 


1 
T 


A 
x 


9M PM bn DM nu DA A > 


a 
a 


azondanounun 


I 


H 
aa a 
x x< 


no 


GGG SG 


OO 
x x 


m n 


E) pl pm) pm) A p) pn) bi 
BG M MS b b be e 


pi 


“< e < 
HAA 


se eebe ee mum 


dd 
HH 


HA 
aa 


a a aZ 


H q El 
D Ou Y 
D Ei o 
AHH 
Du Ay Oy 


HAA 
= Qm 
a ua 
nnn 
NHN 
Maud 
Py Ay Du 
A Dun 
HAA 
nm > 
nnn 
AHH 
BRR 
RK 
> ba Eu 
x oM x 
x x x 


HAAAAAAA 
< O < O OR O n 
Cad fX) p P3 pl p) p) nl 


I 


Du Dy D à, à, 0 00 


S 


nn En F. @ o 


1 


Pdumnuüunuuüud 


d 


Ay Ay Ay As pn non 


d 


d D DD D UY Ds n. Ay 


E 


nn 7777” 


A 


94 P4 > 
nnn 
AHH 


P4 bn A A bn 4 bn 4 


s 


NNN 
HAAA AAnRnA 
E E* E E E* HE 


T 


L 


a 


a p) p) (5) pl pl p) pl 


a aa 


> lu lu 
x x x 
vuv 
O: Du Du 
Di Du Du 
SS 
>>> 
O x al 
=== 


Q 0 Ó 
Bux 
Ay Du Du 
Pu Du Du 
=== 
>>> 


tá oM b x. 2 x 
KR XX 
D oo oO 
dudd Gs 
BEI DEE 
O x O iG Oe OO Ob 


B 


m 


o 
SG SS GG SS 


Du Da >+ Ba ba ba Da 0 
há bá bá bá bá t< i. e 
mx, 

be 

z 

ma 

Du ` Du Ay Du Ay Du Du 
= Ë t= tz tz tz tz Z 


"org epre[epv/'T snaeo[eg snbeiedsy 
111 “T'S IVIDWITIT 
W *9'*d/Ə9UĽH IO[OO$TP OSOUyN 
W 'D'd/'PTTITM $T3SeTeoo eurTeunuo) 
WWO JVAOVNITAIWNOD 

*'D'd ƏPTeTƏPE 
H /`TpuəsM(KIog) suəosəqnI sndrIeooprTesKIuo 
H "erg SpTeTepy/uosjem’s ejewre eeueig 
cti KSC 
N '9'd/a33ouos uməque5r5 uozpuəpolrud 
WR *'9'd/*'1 unje[noweu unrzv 
way aWaowdv 
WwW ‘Y'S IOTAN/*3*1 soAyoeqstp uojabouvody 
SdV IVIDUNOLIDONOdVY 
W "erg epTeTepy/xneyotW POUFUBIÓ etzeqqtbes 
ITV AWGOVLIWASI TW 


SNOQGZTALOIDONON 


W "D'a 9PIET9PV/''I PISJTUTA STITA 
W *)*q/uoyouetTg errozenburnb snsstooueyqieg 
LIA AWFOWLIA 


N *)'d/*yooH seposAyd eəTəurd 
R "5'd/'quaupj ezopo euudeq 
AHL IVIDVIVÍANAHL 


:9*4/*T €e»rai*eq3eo2 snuweyy 
'5'g epre[epv/'u»suosg SNIOTITSIAYA snyjoures 
WHY AVIDVUNAVHY 


zz 


“Z°N "org TIBIO 

/sbbtag'g 3 UOSUYOC*T(“UUND*Y) nzoi PTUOTIOL 
:0*g PITequeD/TeSeYyD srsueebuou eedorTej] 
"5'g/'*1(^1) suedez eejolg 

*5'g ?eiiequeo 
/zeusstey xe Áxs3ou] eonedewueugo eruoosied 
"95'a 9pPIeT9ePV/'TIenW^4 vTTOgTUIe3 PTUPPEIPN 
"90 "org 3singexeM/'ad^W v3e3uep eT3*uoT 

'5"g eiiequeo 
/AuxexotA'M'f 3 IƏSVIJ'I SUSISSIOQIPL er3?uoT 
erg eprerepvy/'oinoa('ueT) unToerneT uozpuəpeonəT 
*Z'N "org TIPI20/"19'Y esTeoxe PTIYTUY 
:9"d/*18'Y SNTTOJFIPT uobodosI 
gp ^5'g IS1NUSAPM/*J je: 28103 WNSUTIDOS WNTIYAIOQUZ 
PTO"N/UOsUYOrC *T(*TISNA* a) euerbur[aep etbutTzeq 
:9"8 Y9SOQUOISATA/*T WNTTOJFIPTTOIS unteqezg 
*5$'g eITEQUeD/*TTTQe] $neorzes SOYIUPUSPY 
Lad AVIOVALOSd 


zzz 


= = = = x= x= x= z 


EE 
LEE 


penunuo) `Z 3T8VL 


311 


Angiosperm Phylogeny Using Protein 


Martin & Dowd 
Sequences 


Volume 78, Number 2 


1991 


GAMUYUNYWYTIAIAOWTIGO 


GAMMWMAITAIIOYNW aw 
dIMXNMNMAITAS3IÓXNITIIG 
dAMXNxuuITZ4JGIÓXNI1ZJT1V 
IAMNNAITITACADATTIY 
dIM9DSETTACIÓNTAIY 
dIMXNNNITAGSIÓNMIGTO 
dIMXNNITTTASIMNMIGTO 
dIM5NUuUTTAIGIÓXITÓO 


AHH 
954 > > 
ann 
HAA 
SS 
EQ on) n) 
Ba Ps Pa 
x x x 
x< >x x 


AAA 
94 PA > 
nnn 
HAAA 
HBB 
a n n 
Ba Du [s 
x< M be 
x< x x 
>=> 


IN/9X N d M AT/OW `S`O/`3 "OOH STTtqertTw PTUISITMIOM 


N 


NAXddMAO 


NA dd M 
NX d NM 


QquNnda 


S f Id M 
SsI/dI/dM 


5 4 Id M 


000 
ËJ n) El 
H 
f As As 
= == 
> > > 


000 
BJ a Al 
H 
Du Ay Ay 
= = = 
>>> 


OOO a 


> 
a 


O x Se 


M'IM IVIDVIHISLIMIAM 
W "org epteTepy/bueyoyny SepToOqor so AdATH etonbesejey 


OXL IVIDWICOXWL 
W "MSN Áeg s,uwuejeg/uosuqoep*T sTUnuuoo eruezoioey 
a99 AVIDVAVIXO 
x: '5'd/*1 *qoTtq obxuro 
XN9 IVIDNODANIO 
W "9'8 epreTepy/'3seg sIITŠezz eapeudg 
Hda AJVAOVYAAH da 


SRHSdSONNRNAXS 


"erg eiiequej3 


W /'XooH xe `uung'y('Ig'u) unsouKə umtsetTdoucjtey 
W "5'd P*119qUeY/*18*Y sNTTosTAeT snydərgsng 

TWS GWAOVOWTINS 
W *S'9/*1 SIPÉTNA UNITITIL 
H `9'a/uossoS (`T) unəoertTTu unzeyjeqdtg 


‘W'S SUSIIOL IƏATNH 
W /lepneas XƏ ‘UTIL (*AeD) SrtSiieng seqtTwberyg 
H `S`O/ `T euuezed untrToT 
W *S`O/ `T ƏI@ŠTnA unəpIoH 
wod awaowod 

urqaweqg':9:5 'gei ees/ (*3*quotey) 
H unueTMOT*S X ASTpuTT wnpunqtzoTz X unrprquAS 
YO dV3oValHONHO 
*9'8 9ePIeTepv/*T wnt TAydodAy snosny 
*9'8 epPreTepy/*inoT eqeotds edot117 
"o'a SPTETSPY/SUNTE ior3e[e P1I5TPTdSY 


xxxx.-—_———_—_—_._—__—_——_—_____JJJ—— A nI-gZ,U—4—s——— aee 
— n |Ə— nÁs—————— 


'penunuo) *z 318V[ 


312 


angiosperm ancestors. All three members of Acan- 
thaceae that were tried failed with symptoms like 
these, as did four out of six species from Caesal- 
piniaceae. 

Finally, an entirely different sort of failure oc- 
curred with four species, all hemiparasites from 
the putatively related families Santalaceae and Lo- 
ranthaceae. These species had abnormally high 
amounts of phenolics, but it seems unlikely that 
failure can be attributed to them or to any of the 
other causes mentioned above. The preparations 
always yielded abnormally high amounts of plas- 
tocyanin but no trace of rubisco-SSU. Plastocy- 
anin, a chloroplast protein, has a molecular weight 
sufficiently close to rubisco-SSU that it occurs, 
occasionally, as a small contaminant detected dur- 
ing sequencing. It could be identified by its se- 
quence but, except in these two families, it was so 
weak that it disappeared after about seven posi- 
tions. The strength of the plastocyanin sequence 
in all four of these hemiparasites suggests that the 
absence of rubisco-SSU could not be ascribed to 
some general difficulty like proteases, but might 
reflect an unusual, perhaps facultative, photosyn- 
thetic system. 


GENERAL REMARKS ABOUT THE SEQUENCES 
OVERALL VARIATION AND INVARIANT SITES 


A summary of the variation that we have ob- 
served is given in Table 3; the amino acids most 
commonly observed are in the top line. 

The rubisco-SSU gene includes two introns, the 
first of which is inserted before the codon that 
determines amino acid 3. It determines valine and 
this, like tryptophan at position 4, is invariant, the 
two codons carrying the signal to cut the end of 
the intron (Berry-Lowe et al., 1982). These in- 
variant residues were useful early signals that the 
correct protein fraction had been chosen. Within 
the first 40 amino acids, proline always occurs at 
position 5 and/or 6, at position 19 and/or 20, 
and at position 40. These three regions correspond 
to bends in the tertiary structure of the molecule. 
Chapman et al. (1988) have indicated that between 
the first and second bend there is alpha-helix and 
thereafter beta-sheet. There is an almost invariant 
region from amino acids 13 to 18, a region that 
makes contact with one of the large subunits (Chap- 
man et al., 1988; Knight et al., 1989). The only 
variation we have found in this region is the sub- 
stitution of phenylalanine for leucine at position 15 
in five species of Solanum (Martin et al., 1986). 
These same species also have phenylalanine sub- 
stituted for leucine at the almost invariant position 


Annals of the 
Missouri Botanical Garden 


21. The simultaneous occurrence of two very rare 
substitutions indicates a causal connection. Hydro- 
phobic bonding between the two positions may sta- 
bilize the bend at position 19 and, because these 
species are inhabitants of very hot and arid regions, 
this may have been a factor in natural selection. 


HETEROGENEITY WITHIN SPECIES 


The first reports of rubisco-SSU sequences (Stro- 
baek et al., 1976) were for the N-terminal amino 
acids in species of Nicotiana, and these showed 
heterogeneity at positions 7 and 8 in tobacco. We 
have also found it at position 30. These hetero- 
geneities are undoubtedly associated with the am- 
phidiploid origin of tobacco and led to the expec- 
tation that heterogeneity would be fairly common, 
not only because about one-third of plant species 
are polyploid, but also because in diploids gene 
duplications are frequent. We may not have de- 
tected some heterogeneities (for example, those 
involving serine, which gives a weak signal), but 
we did detect 34 species with one heterogeneity, 
11 with two, and 4 with three. The demonstration 
by Pichersky et al. (1986) that in tomato there 
were at least three different DNA messages for 
rubisco-SSU, all with the same N-terminal amino 
acid sequence, suggests that selection acts strongly 
to preserve primary amino acid structure. There 
are at least eight different genes encoding rubisco- 
SSU in petunia (Lamb & Fitzmaurice, 1986); for 
this reason, when we prepared protein from that 
species, we used a mixture of equal quantities of 
leaves from four morphologically different varie- 
ties, with the aim of finding heterogeneities (Martin 
& Dowd, 1984b). The sequence was of high quality 
but no heterogeneity was detected. Likewise, we 
chose to study Rhoeo discolor because it is a 
complex interchange heterozygote for all chro- 
mosomes and might therefore be heterozygous for 
rubisco-SSU, but we detected no heterogeneity. 
Heterogeneities that were found presented no prob- 
lem for the computer analysis. 


INSERTIONS 


Only two examples of additional amino acids in 
the N-terminal sequence have been found. Both 
species of Epacridaceae that we studied had an 
additional isoleucine between normal positions 9 
and 10. Teucrium flavum (Lamiaceae) had two 
additional glycines, probably between the same two 
positions. These insertions, while clearly of taxo- 
nomic significance, have been ignored during data 
processing. 


Volume 78, Number 2 
1991 


THE N-TERMINUS 


Haslett et al. (1976) reported that the N-ter- 
minus of rubisco-SSU was “frayed,”” some mole- 
cules seeming to have methionine in position 1 
while others are without it. This is the situation 
that we have encountered in the vast majority of 
species, the effect being that at every position two 
amino acids are recorded, the correct one and the 
next one. Usually the two signals are approximately 
equal, especially when the protein is of highest 
quality. This property is helpful in that it provides 
a second opportunity for identification and is useful 
for identifying minor contaminating proteins whose 
residues appear only once, but probably means that 
it is more difficult to obtain long sequences because 
attenuation of the signal occurs earlier. 

All those species for which nucleic acid sequenc- 
es have been reported were also studied by us and 
all show fraying. Because the nucleic acid sequenc- 
es show the N-terminus to be methionine, there is 
no doubt. The signals we obtained were not typical 
for either methionine or its sulfone derivative. 
Whether the derived amino acid is obtained by 
dansylation (in manual sequencing) and identified 
by TLC, or is the PTH derivative from automatic 
sequencing and identified by HPLC or TLC, the 
N-terminal amino acid moves differently from me- 
thionine; therefore, we conclude that it is a modified 
form of that amino acid. 

Two exceptions to the above generalization have 
been encountered. In 10 out of 11 species of the 
Onagraceae the N-terminus is phenylalanine, the 
only variations from methionine known, and in 
these there is no sign of fraying. In six other species 
the N-terminal amino acid is methionine (and gives 
the normal signal for PTH-methionine), but there 
is no sign of fraying, the difference from the ma- 
jority of species being sharp and unmistakable; 
these are two members of Papaveraceae (Papaver 
orientalis and Eschscholtzia californica), two from 
Pedaliaceae (Sesamum indicum and Ceratotheca 
triloba), Vitex lucens (Verbenaceae), and Mentze- 
lia lindleyi (Loasaceae). Any hypothesis to explain 
fraying must account for these exceptions, and we 
believe that they exclude artifacts arising from 
techniques of protein production or sequencing. 
Any hypothesis must also account for the modifi- 
cation of methionine and the equality of the two 
forms of the protein. We therefore dismiss as un- 
likely hypotheses relying on inefficient shortening 
of the protein either as it passes through the chlo- 
roplast membrane or after entry. 

It is known that rubisco-SSU forms dimers (Roy 
et al., 1978), and we suggest that this may be 


Martin & Dowd 313 
Angiosperm Phylogeny Using Protein 


Sequences 


through formation of disulphide bridges between 
the N-terminal methionines of two SSU molecules. 
This might occur in vivo if the enzyme model of 
Chapman et al. (1988) is correct, but is more likely 
an in vitro event if the different model of Knight 
et al. (1989) is correct. If dimers are formed, 
S-carboxymethylation at pH 8.6 would not break 
an inter-methionine bond, but we suggest that the 
dimer does fall apart so that dimethionine is on 
one chain and no methionine on the other. This 
hypothesis would account for all phenomena except 
for non-fraying species, which presumably do not 
naturally form dimers. 


METHODS OF DATA ANALYSIS 


Before computer analysis, amino acid sequences 
were converted to inferred nucleotide sequences 
using the genetic code. Usually this could be carried 
out after inspection of, for example, all the se- 
quences in a Group so that the most parsimonious 
choices of codons could be made. A standard was 
chosen at sites where substitution was silent. Al- 
though a program was available (Martin et al., 
1983), usually the path was obvious and computing 
unnecessary. Thus the unit of length in phyloge- 
netic trees is an inferred nucleotide difference 
(i.n.d.). 

The number of dichotomizing trees (phyloge- 
netic Steiner trees) connecting N taxa is 1 x 3 x 
5 .... (2N-5). The principle of analysis is that 
the length of every possible tree is calculated and 
the shortest tree is chosen as the most probable. 
This agrees with the parsimonious hypothesis that 
evolution has proceeded by the shortest route. 
However, because the total number of possible trees 
increases very rapidly, i.e., when increasing from 
N-1 to N taxa it increases (2N-5)-fold, it is not 
always possible to consider every tree. 

Except during the final stages of this project, 
the program that we used was MINTREE, the 
"branch and bound" program of Hendy and Penny 
(1982). With a Vax 785 computer the usual limit 
for simultaneous analysis was 12 taxa. This limit 
could be extended to about 15 with a supercom- 
puter (Cyber 205), but the trouble and expense 
precluded useful work. Although MINTREE has 
now been superseded, its co-program ANALYZE 
is still used because it possesses efficient routines 
for obtaining ancestral sequences and internodal 
lengths. 

Most of the analyses have been carried out using 
HENNIG86 (Farris, 1988) and a personal com- 
puter (Microbyte 230). This system is about two 


Annals of the 


314 


Missouri Botanical Garden 


=> 


€ 
A 
I £ I 
L Y d 
I 8 I 
W N 1 9 
r4 tr $ I Ë 
Ó 9 g W d 
tx Um m pP Z 
u 5 ks: Y 3 9 V 
E I I [^ 4 o8 K x S I 
3 43 S Ë T L N v q 3 L 
2 5 I E 1 2 € 6 Lomo € Eu I I 
L Y 9 S Y N d V ^ S S L. S S S 
€ 9 I M £ 4g 0 Š I] Gn ROG ESO UN I 
A A d Ó J j E A Ó E. d A To" USO N 
E 28 SI c D. e E * O0 I EN DE et Moa d I 
L CT Ñ I N L q A Ó d N L WA KS X ST q 
S E ei € D * 6 de BP X “ee 1 E omo T gt: I 
T Ó 3 0 5 S ) 3 3 NA SO 3 Ó I V L N I V 
D Ae Gp "GT £ Z: SI ST OU ees, IL € T S Se Z toe Ñ 8 
9 S WS y H 9 sagadi 3 I T X. WO TA M 1 
Z L gott 2 om TE D QE BE VE e 901 2 bom Lh oz X 0€ 8b T 6 
S N NUL 43 N "UH X TÍ 4 V O0 d X X- E "X St po ON. oW I UY W 
S8 9 vb 19 SE 6$ L 9 SS LS 87 6 ZS € OS 6l St SL € 9E S Z 67 * OT Lh €Z <€ 82 € £L 1 
AE wt ZZ OW OI 00 0y TD T GE ES ol WO d S A k. Gk CD WON OW E CN Mod 
9€1 98 OET 001 ZO SB ¿II ££I SEI OOI EOI IZI 871 18 9€l S6 SII ¿F S6 YET 801 ZEI DEI 9£l YET YET 9£T SET ZIT IET 631 86 ZI 89 FIT SEI O£T EI SL 9ET 
á. Fb. We 3 e TI Oe Sp KC QU A XO4 * UL eS Se SSES. € D od Wo Xo. SMA 5; Sp: Q d X4 00 NW 
Or 6€ 8€ LE 9€ SE PE EE ZE TE OF 62 BZ LZ 97 SC HZ ES SS IZ OZ ei BI ¿1 91 SI vl EI Zl Il OL 6 8 La S + € Z@ I 


`Z 9|qe] vas sjoquiás proe ourure 01 Aa 104 “punoj st y 
sammurej Kueur MOY ut sajeorpur proe ourure ue 1apun 1equinu au] 'sjue[d paes jo seqmurej o ] Wo1} ()SS-0)SIg(1W Jo sesuanbas [euturiə1-N jo Áreurumng 'ç£ 3I8V[ 


Volume 78, Number 2 
1991 


orders of magnitude faster than the above and has 
the additional advantage that an analysis can be 
left running for days or weeks. The principles of 
its algorithms have not been published, but the 
time of its release suggests a possible connection 
with a published letter by Johnson (1987). 
HENNIG86 offers a number of options, which com- 
parison with MINTREE using the same sets of data 
suggest are reliable; in order of preference we have 
used implicit enumeration (ie*); ie followed by bb 
(branch swapping); mhennig followed by bb. Be- 
yond the number of taxa that can be handled by 
MINTREE or the ie option of HENNIG86, correct 
solutions cannot be guaranteed. 

A further advantage of HENNIG86 is that it 
includes a program for successive weighting, which 
often reduces the result to one or very few trees. 
This usually eliminates the need to derive consensus 
trees, a process that we have found unsatisfactory 
(Martin & Dowd, 1989). Finally, HENNIG86 de- 
rives the ci (consistency index) (Carpenter, 1988) 
and ri (retention index) which we record with each 
figure of a tree. In a personal communication, 
Farris defines these as follows: if r and m denote, 
respectively, the smallest and greatest number of 
steps that a character can require on any tree, and 
s denotes the number of steps that character re- 
quires on a considered tree, then c.i. is r/s and 
r.i. is (m-s)/(m-r). 

MINTREE uses data such that each of the four 
nucleotides is entered as 1, 2, 4, or 8, which allows 
the counting of ordinary differences and also of 
heterogeneities; no matter what variation occurs 
at a site, it can be recorded as a sum that is always 
different for different combinations. Provided there 
are no heterogeneities, HENNIG86 can use the 
same notation (using the nonadditive option); if 
there are heterogeneities, they must be recorded 
by inserting additional taxa. This is satisfactory if 
there is only one variable site within a taxon when 
only two taxa need to be recorded. Assumptions 
of linkage must be made, however, if there is more 
than one variable site but only two taxa are to be 
recorded. This problem becomes increasingly im- 
portant as an analysis progresses from using raw 
data to derived ancestral sequences for families 
and then Groups because, in these, heterogeneities 
may be numerous. We have therefore used alter- 
native strategies. The first is inserting additional 
taxa as just described and accepting the result if 
the different versions of a taxon cluster without 
interruption. 

The second strategy is to use binary coding for 
nucleotides; e.g., 1000 for A, 0100 for G, 1100 
for A and G, 0010 for C, and so forth. In con- 


Martin & Dowd 315 
Angiosperm Phylogeny Using Protein 


Sequences 


junction with HENNIG86, MINTREE notation is 
slower than binary notation, which is therefore 
advantageous. As would be expected, binary gives 
a tree length double that using MINTREE notation 
but it is seldom exact. If inexact, the length is 
always less than double, and there is a loose re- 
lationship between the deficit and the number of 
heterogeneities. Because the details of HENNIG86 
have not been published, we have been cautious 
about choosing between these alternatives and have 
done all Group analyses with HENNIG86 using 
both notations. 

We structured our investigation such that the 
majority of families were represented by at least 
two species from different genera. If we accept 
that taxonomy is seldom wrong when placing gen- 
era within families (Heywood, 1978), then we have 
an empirical way of judging the merits of the two 
notations. Omitting families that have either a sin- 
gle representative or are multiply represented, all 
Groups have been analyzed using both notations 
and, from the minimal trees recorded, we have 
chosen the best as judged by pairing of represen- 
tatives of families. In eight Groups both methods 
had the same best tree, in six binary gave the best, 
and in eleven MINTREE notation gave the best. 
In the section **Analyses Within Groups" we have 
therefore used the taxonomically best minimal tree 
no matter which notation was used to derive it. 
However, in later sections we have used binary 
notation exclusively because it is quicker and more 
convenient. 

Among best trees, 79% of families showed cor- 
rect pairing of its members. When judging this 
result, it should be remembered that a single mis- 
placed species will often result in the failure of 
pairing of representatives of two families. While a 
few such occurrences may be the result of incorrect 
taxonomy, the remainder are presumably caused 
by convergent evolution. The details can be seen 
in the figures for the Groups. 


ANALYSES WITHIN GROUPS 
EXPLANATION OF THE FIGURES 


The figures are drawn to scale, which is indicated 
by the length of one inferred nucleotide difference 
(i.n.d.). Only lengths have meaning, not angles. 
Usually at least two trees have been derived for 
each Group, one using sequences of individual spe- 
cies and the second using derived familial nodes. 
If the first shows congruent grouping of putative 
members of families, then the second is not needed. 


316 


Annals of the 
Missouri Botanical Garden 


Disruption of familial grouping is often caused by 
the sole representative of another family or by a 
member of a multiply represented family. The sec- 
ond tree is derived from familial nodes and single- 
tons, is drawn with a different scale, and uses only 
the three-letter familial abbreviation of Weber 
(1982), which is given in Table 2. When appro- 
priate, trees of multiply represented families or 
genera are also given. In later analyses, Group 
nodes, and often one or two others, will be used; 
each of these is numbered. 


The Base of the Angiosperm Tree 


Before detailed analysis of Groups began, many 
analyses were done using the five gymnosperms, 
representatives of the monocotyledons and of 
Groups 1, 2, and 3 which were most likely, on 
taxonomic grounds, to be near the root of the 
dicotyledon tree. The angiosperm family closest to 
the gymnosperms was Schisandraceae. Figure 1 
shows the junction of the gymnosperms, Schisan- 
draceae, and the other angiosperms. The derived 
sequence of this node has been used as “Base” in 
all subsequent Group analyses. 

It will be noted that Figure | is different from, 
and taxonomically more satisfactory than, the 
equivalent figure of Martin and Dowd (1989). Since 
then a sequence of Welwitschia has been obtained 
and this paired with Ephedra between the angio- 
sperms and the other gymnosperms. Three at- 
tempts to study Gnetum were made but all failed 
with symptoms suggesting strong leaf protease ac- 
tivity. 


WINTERACEAE 
EPHEDRA 


Ké 


/ 
Rt Ae CALYCANTHACEAE / 
METASEQUQIA *, S M 
Q 
da \ 


GINKGO 4, 
GAN RANUNCULACEAE 


Za Fi 

> SCHISANDRACEAE 

S N y P: 
KH x N d 


UZ, NYMPHAEACEAE 
/_NYMP 
\ \ gem SCALE 
* z hind 


x i 
BASE 
OF DICOTYLEDONS 


FIGURE 1. Five gymnosperms analyzed with familial 
nodes of five angiosperm families from Groups 1, 2, and 
3. The ancestral sequence derived for the junction of 
Schisandraceae has been used as an outgroup for ana- 
lyzing the Groups of dicotyledons. 


Group 1. An attempt to study Hedycarya 
(Monimiaceae) having failed, Peumus was left as 
a singleton, which was therefore omitted from Fig- 
ure 2a but included in Figure 2b. Correct pairing 
and grouping occurs in all families except Aristo- 
lochiaceae for which a derived familial node is 
shown in Figure 2b. As indicated above, Schisan- 
draceae is nearest to Base with a rather long gap 
to the remainder. 


Group 2. In contrast to the straightforward- 
ness of the previous Group, Group 2 has presented 
problems that arise from the great intrafamilial 
variation of Ranunculaceae and Papaveraceae, the 
trees for which are shown in Figure 3b and c. The 
two derived familial nodes are shown with the rest 
of the Group in Figure 3a. We interpret Ranun- 
culaceae and Papaveraceae to be ancient angio- 
sperm families, and only with some misgivings have 
we adhered to our acceptance of current taxonomy 
at the levels of family and below. This is especially 
so for Papaveraceae, but splitting off Fumariaceae 
creates more problems than it solves. 


ANNONACEAE 


TAS 
Xylopia Liriodendron "e 


Desmos 


(A) 


Drimys 
u 
5 Oo 
Q Idiospermum É 
Ë Pseudowintera Chimonanthuso 
z j 4 
= Tasmannia > 
o 
alycanthus š 
Aristolo cay 
Myristica % 
2 
O, 
wW 2 
« Kadsura o 
Š % 
2 Knema Ç, 
c < 
a 
E: 
[7] SCALE ci 85 Ri 89 
2 1 ind. 
9 Schisandra Base 


SCALE 
BASE d ci93 Ri 89 


GI 
tin 


FIGURE 2.—(A). Group 1, omitting the single repre- 
sentative of Monimiaceae. —(B). Tree of family nodes for 
Group 1. 


Volume 78, Number 2 


1991 
MENISPERMACEAE 
Cocculus 
ly ypserpa 
ü 
o 
< 
= 
? 
m 
r 
(A) Lardizabala > 
o 
a 
š 
O 
RANUNCULACEAE m 
> 
Ç m 
Decaisnea 
PAPAVERACEA 
SCALE 
fing, 191 Rig2 
Aquilegla 
Helleborus 
Papaver 


Eschscholtzi 


Clematis viticella 


(B) 


(C) 
Dicentra 


RANUNCULACEA 
n 


SCALE cae 
— Ri 88 : 
indi Ci 92 i 


lematis rehderiana SONE oi 87 Ri 86 


1 
FicURE 3. (A) Group 2, with family nodes, derived 


from (B) and (C) for Ranunculaceae and Papaveraceae 
s.]. 


Nymphaea Ts 
“e 
4c 
S4 


CABOMBACEAE 


Victoria Nuphar Brasenia 


: SCALE 
| J 


1 i.n.d. 


Ci 100 Ri 100 


FIGURE 4. Group 3. 


Fumaria 


Martin & Dowd 317 
Angiosperm Phylogeny Using Protein 
Sequences 

Liriope Ruscus 


SMILACACEAE 


Asparagus 


POACEAE 


ARECACEAE 


COMMELINACEAE 


Cymbidium 


Sagittaria 


Aponogeton 


BASE of MONOCOTYLEDONS 


| SRA Ci 88 Ri 80 


FIGURE 5. The monocotyledons that have been stud- 
led with some families represented by their nodes. 


Group 3. Failure with a species of Cabomba 
has left Brasenia as a singleton that does not 
separate from the three members of Nymphae- 
aceae; however, the internode is so short that we 
draw no significance from it (Fig. 4). 


Piperaceae, Nelumbonaceae, and the mono- 
cotyledons. Piperaceae and Nelumbonaceae have 
not been placed in a Group because in both cases 
opinions differ among the four phylogenies consid- 
ered when we nominated Groups. All place them 
in either our Group 3 or Group 1, so it is appro- 
priate to carry out a joint analysis with the members 
of these two Groups, and at the same time to 
consider the links with the monocotyledons. Al- 
though no species have been added to those re- 
ported earlier (Martin & Dowd, 1989), the se- 
quences have been reanalyzed with HENNIG86. 
To reduce the number of taxa to be compatible 
with the ie option, familial nodes have been used 
for Araceae, Arecaceae, Commelinaceae, Poaceae, 
and Smilacaceae (Fig. 5). The monocotyledon node 
has been derived and is included in Figure 6. The 
result is different from that of Martin and Dowd 
(1989), and this is presumably because of the new 
computing program. 


Annals of the 


318 
Missouri Botanical Garden 
ee s 
e 
LAURACEAE cere ipturus 


MAGNOLIACEAE 


ARISTOLOCHIACEAE 


CALYCANTHACEAE 
ANNONACEAE T 
Otlfer WINTERACEAE d 
MONOCOTS 9 
3 
D 
MYRISTICACEAE 
Párasponia 
PIPERACEAE 
1 
SCALE . I 
1 ind. Ci 93 Ri 89 
SCHISANDRACEAE 
FIGURE 7. Group 4. 
n Ci 88 Ri 88 
GYMNOSPERMS 
FIGURE 6. The base of the angiosperm tree showing 
the relationship of the monocotyledons to members of 
Groups 1, 2, and 3 and Nelumbo and Piperaceae. 
Parottia Fagu 
uj N.aequilateralis 
Lu 
2 Gymnostoma 
S o 
g $ 
3 e 
= > N.solandri 
TI z 
Zz 
S 
uarina 
m 
Liquidambar llocasuarina 
N.menziesii 
Quercus 


pale 


SCALE Ci 94 Ri 91 
1 ind. 


FIGURE 8.— (A). Group 5. The node for Nothofagus was derived from (B). 


Volume 78, Number 2 


Martin & Dowd 319 


1991 Angiosperm Phylogeny Using Protein 
Sequences 
Gordonia 
Brackenridgea 
Dillenia E 
SE 
£ 
Garcinia ° 
uy 
> 
ñ] m 
O 
< 
LO 
> 
Se 
O 
Hypericum 
SCALE _ l 
tind, Ci 66 Ri 64 
FIGURE 9. Group 6. 
Group 4. There is good pairing between mem- order when only familial nodes are used. However, 


bers of Ulmaceae and Urticaceae but not Moraceae 
(Fig. 7). It was only after failures with two species 
of Ficus and one of Maclura, almost certainly due 
to protease activity, that we supplemented with 
Humulus, knowing that its taxonomic position was 
not entirely clear. The failure of Humulus and 
Morus to pair was therefore not surprising. Sub- 
sequently, Humulus was removed from Group 4 
and added to the list of uncertain taxa (see below). 


Group 5. The tree of Nothofagus (Fig. 8b) is 
slightly different from that of Martin and Dowd 
(1988) because it is influenced by the weighting 
procedure of HENNIG86 and because it includes 
Fagus, which does not separate. From this tree a 
node has been derived and used in Figure 8a. While 
Betulaceae, Casuarinaceae, and Hamamelidaceae 
have correct grouping, the junction with Base di- 
vides /Vothofagus from Quercus. 


Group 6. Figure 9 differs from one already 
published by Martin and Dowd (1989); the two 
trees are of the same length but this one is preferred 
because it shows perfect pairing and reflects the 


the other probably conforms better with taxonomy 
in that Dilleniaceae is separate from the other three 
families. 
Group 7. Figure 10 shows that the two rep- 
resentatives of Juglandaceae pair leaving Myrica, 
the sole representative of Myricaceae, separate. 


Juglans 


< 

9 
eg 
> 


MYRICACEAE 


Z 
J 
> 
O 
> 
Myrica m 


arya 


1 
¿SCALE 
Ci 91 Ri 93:1 ind. 


FIGURE 10. Group 7. 


320 


o A 1 


NYCTAGINACEAE 
Bougainvillea Mir bilis 


Phytolacca 
Chenopodium Y 


> 
y I 
Mi < 
Ó m 
< ° 
po es 
a » 
g 8 
9 Š 
> > 
o m 
ES 
ë Codonocarpus 
3 
r. 
3 
2 
> 
Silene % 
m 
SCALE 
TTA Ci 88 Ri 85 
PHT 
AMA 
CHN 
CRY 
y i 
(B) SCALE b 
Ying Ci 98 Ri 94 NYC 


FIGURE 11. —(A). Group 8. —(B). Group 8 family nodes. 


Group 8. The “Centrospermae” is one of the 
most unsatisfactory groups with representatives of 
two families, Chenopodiaceae and Amaranthaceae, 
failing to form pairs (Figure lla). Spinacia and 
Beta have identical sequences, but these are quite 
different from Chenopodium. The tree for family 
nodes is in Figure | 1b. 


Group 9. There is a marked difference be- 
tween the two representatives of Tiliaceae; Grewia 
is at the bottom of the tree (Fig. 12), while Spar- 
mannia disrupts the clustering in Malvaceae. How- 
ever, the remaining four families are satisfactory. 
Grewia was removed from Group 9 and added to 
the list of uncertain taxa. (As will be mentioned 
later, it subsequently rejoined.) 


Group 10. While Violaceae, Cucurbitaceae, 
Salicaceae, Brassicaceae, and Flacourtiaceae 
formed good clusters (Fig. 13a), the two represen- 
tatives of Datiscaceae (Datisca and Tetrameles) 
were very different. Attempts to study Capparis 
having failed, Cleome was left unpaired so we chose 
Reseda from the putatively related family Rese- 
daceae. Since these two did group, we did not seek 
correct partners for them. In addition to these two 


Annals of the 
Missouri Botanical Garden 


Chorisia BOMBACACEAE 


Durio 


Sparmannia i 
Dombeya y i 
d I 
< 
Š 
£ 
, AS 4 
Brachychiton o .^ 
d Sphaeralcea aithaea 
js Y 
ACEA Malta E: 
Sheer ne Dryobalanops $ 
P O 
G Elaeocarpus 2 
$ horea Y 
D 
ch > 
% m 
Q. m 
Aristotelia 
Grewia 


SCALE ci gi Ri 79 
1 ind. 


FIGURE 12. Group 9. 


singletons, Moringa represents a monogeneric 
family. A tree from family nodes is shown in Figure 


13b. 


Group 11. 


The representatives of all four fam- 
ilies form pairs (Fig. 14), Myrsinaceae adjacently 
and the other three families dichotomously. 


Group 12. As mentioned earlier, the two spe- 
cies of Epacridaeae are distinguished by having an 
additional amino acid inserted in their sequences. 
Although this could have been used as a character, 
it was unnecessary because the two species paired 
separately from the two Ericaceae species (Fig. 


15). 


Group 13. When family nodes are derived for 


Rosaceae, Cunoniaceae, and Saxifragaceae, they 
are very close (Fig. 16b), so it is not surprising 
that there is confusion when individual species are 
analyzed (Fig. 16a). The representatives of Rosa- 
ceae pair correctly, however. 


Volume 78, Number 2 


Martin & Dowd 321 
1991 Angiosperm Phylogeny Using Protein 
Sequences 
BRASSICACEAE € 
Raphants Nasturtium ot REP rehenes 
M nid opr 


Mimusops 


Kiggelaria -q 
d 
Š 
(A) ° Styrax 
2 
= 
> 
O 
> 
Azara m 
W Salix 
o id 
< 
Populus 
= Sc Cucurbita 
< 
v 
| 
Melicytus 
\ 
\ 


\ 


g5 

$ VIOLACEAE 
> 

e 

o 

c 

o 

£ 

3 

n 


Ez, : 
DAT ; 1 ind. Ci 89 Ri 90 


PRIMULACEAE 


SCALE j 
Ci 94 Ri 92 
l 1 Ind. 


FIGURE 14. Group 11. 


Arbutus 


FIGURE 13.—(A). Group 10, omitting single represen- 
tatives of families. —(B). Family nodes of Group 10. Be- 


cause they did not pair and are sometimes placed in = 
different families, Datisca and Tetrameles are included Ly < 
here. = 4 
Q 
E Astrol o 
ib stroloma “Ya 
"Ze 
eucopogon 


Group 14. 


Among minimal trees derived when 
all legume species are analyzed simultaneously, 
there are some in which the two Mimosaceae spe- 
cies pair and so do the two Caesalpiniaceae; how- 
ever, the eight Papilionaceae species are confused. 

: Se ¡ SCALE 
We have therefore derived a Papilionaceae node Oi es BLTS Lem 
separately (Fig. 17b) and show this with the other E 


two families (Fig. 17a). FIGURE 15. Group 12. 


Annals of the 


322 
Missouri Botanical Garden 
EE 
Hydrangea Leucaena 
& 
< 
d 
¿$ 
Ju CUNONIACEAE S 
Spy, os Callicoma S % 
E? Ai 
ë Aphanopetalum TEEN eratonia e. 
Prunu: z 
Gleditsia% 
ergenia y 
i (A) 
pa Ci 96 Ri 92 : — 
Mm, C198 Ri 94 i 
ROS Glycine 
Lu 
SAX 
(B) 
Oxylobium 
SEALE GUN Hovea 
FicuRE 16.—(A) Group 13.—(B). Family nodes of WEE 
Group 13. see 
1 


FIGURE 17. (A) Group 14 with Papilionaceae rep- 


Lopezia resented by a node derived from (B). 


Ludwigia 


DI 
< 
LI 
o 
< 
D 
O 
< 
L 
° 


Circaea 


Quisqualis 


Tibouchina 


(A) 


Melástoma 


MELASTOMATACEAE 


SCALE 
KC Ci 87 Ri 88 


Oenothera 


E.canum 


SS 
GE ee 


SALE Ci 90 Ri 84 
AU 
ting 


(c) 
Hauya 


i SCALE Ci 92 Ri 86 
t tina 


FIGURE 18. (A) Group 15 omitting Trapa and Punica, which are included with family nodes in (B). (C) Onagraceae. 


Martin & Dowd 323 


Angiosperm Phylogeny Using Protein 
Sequences 


Volume 78, Number 2 
1991 


it could be due to the inclusion of new families, 
the earlier choice of inappropriate outgroups, or 
the new analytical methods. 


Cassine 


$ 


Ww 

dÉ Strombosia 
o 
Ye 


Group 16. As discussed earlier, all represen- 
Y,  tatives of the hemiparasites of the Santalales and 
Loranthaceae failed to yield protein samples, so 

Euonymus this Group is reduced to Olacaceae and Celastra- 
ceae in which pairings are straightforward (Fig. 


19). 


% 
X» 
A 
2 
To 
< 


Ochanostachys 


Group 17. This Group is not very satisfactory 
possibly because, as indicated in Figure 20b, there 
has been a rapid radiation. The consequences are 
that the members of Simaroubaceae and Sapin- 
daceae do not pair, while Flindersia, sometimes 
excluded from Rutaceae, does not group with the 
¡RAEE Gí ee Ri 97 other two representatives of that family. However, 
; din. : Hes 
i there is good pairing for Connaraceae and Ana- 
FIGURE 19. Group 16. cardiaceae (Fig. 20a). Melia having failed, Cedrela 

is left as the sole representative of Meliaceae. 


Group 18. The two members of Haloraga- 
ceae, Gonocarpus and Haloragodendron, are so 
confounded with the three members of Rhizopho- 
raceae (Fig. 21) that there was no point in deriving 
family nodes to derive a Group node. 


Group 15. This Group, which corresponds to 
the order Myrtales, was discussed by Martin and 
Dowd (19862). Since then only Trapa and Punica, 
both singletons, have been added. When the rep- 
resentatives of the other five families are analyzed, 
pairing is good except in Lythraceae (Fig. 18a). 
The three members of Onagraceae in this tree are 
from the bottom of the family tree (Fig..18c). When 
family nodes are analyzed (Fig. 18b), the root of 
the tree is in a different place from the one pre- 
viously published; it is uncertain why this is so, but 


Group 19.  Nitraria having failed, Zygo- 
phyllum is a singleton as is Tropaeolum, for which 
no partner was available. As shown in Figure 22a, 
the members of Geraniaceae and Malpighiaceae 
pair. The family node tree is shown in Figure 22b. 


Pistacia 


ANACARDIACEAE 


y Vus 


Mangifera 


¡ SCALE Ci 89 ni 76 A 
1 Tod. CNN 
Scare Ci 95 Ri 88 I (B) 


FIGURE 20. (A) Group 17 omitting Melia, which is included with family nodes in (B). 


324 


Annals of the 
Missouri Botanical Garden 


Anisophyllum . 


jaloragodendron 


SCALE 
Ko Ci 97 Ri 95 


FIGURE 21. Group 18. 


Group 20. Hoya was left a singleton by failure 
to extract protein from two other members of As- 
clepiadaceae, Asclepias and Cryptostegia. The 
members of four families showed dichotomous pair- 
ing while Logania paired alongside Buddleia, which 
is only possibly a member of Loganiaceae (Fig. 23). 


Group 21. The representatives of Lamiaceae 
and Verbenaceae were very similar but there was 
nevertheless a minimal tree in which congruent 


pairing occurred (Fig. 24). 


Q 
“e 


Ke 
S Pelargonium 
Es 


Erodiu 


(A) 


i SCALE ci 100 Ri 100 
i "1 ind. 


ZYG 


3 


SCALE $ e 
Ark 88 Ri 50 | 


FicuRE 22. (A) Group 19 omitting Zygophyllum and 
Tropaeolum, which are included in (B). 


Group 22. There have been previous reports 
of Solanum (Martin et al., 1986) and Nicotiana 
(Martin & Dowd, 1984b). Using HENNIG86, new 


Mandevilla 
ly 
Mi 
RUBIACEAE O 
Gardenia = 
à S ce 
oprosma & oon Buddleia 
, x e 
à Vinca Logana Fraxinys 2 
& > 
& lea m 
e 
Eustoma 


SCALE 
LI 
1 i.n.d. 


FIGURE 23. Group 20. 


Ci 75 Ri 82 


Volume 78, Number 2 Martin & Dowd 325 
1991 Angiosperm Phylogeny Using Protein 
Sequences 
Teucrium nodes have been derived for both genera (Fig. 25b, 
c) and were used, with Anthocercis, to represent 
& Solanaceae in Figure 25a. These group well but 
e there is confusion between the representatives of 
RS Convolvulaceae and Polemoniaceae. 
> Group 23. As mentioned earlier, all attempts 
to extract rubisco from Acanthaceae species failed. 
The representatives of the other four families of 
i this Group pair well, Scrophulariaceae, Gesneri- 
Phlomis aceae, and Bignoniaceae dichotomously and Pe- 
daliaceae adjacently (Fig. 26). 
ñ 
2 H 
Citharexylum% 
£ 
O 
T 
1 m 
| SCALE erar Ares we 
3 1 Log, 
FIGURE 24. Group 21. 
(A) 
Ipomoea «y 
SOLANACEAE C 
Nicotiana node Sy 
o” 
Dichondra Ss 
Anthocercis e 
Solanum node 
| KH Ci 94 Ri 93 
S.viridifolium 
S.chippendalei 
S.oligacanthum| PS.petrophilum 
(8) .cinereum 
Petunia hybrida 
S.quadriloculatum N.tabacum2 
(c) (=N.tomentosiformis) 
° 
.macrocarpon N.sylvestris 
E nr T Wd 
marginatum N.afi 
rina Zen 
i Rz Ci 85 Ri 91 
FIGURE 25. (A) Group 22 with the Solanum node derived from (B) and the Nicotiana node from (C). 


326 


Annals of the 


Missouri Botanical Garden 


Digitalis CROPHULARIACEAE 
Paulownia 
Sinningia 
F 
9 
Q 
E Pandorea 
E PEDALIACEAE 
O 

Saintpaulia 


Ceratotheca 
Sesamum 


o 

O 

z 

O 

z 

> 

O 

J da 3 
acaranda m 


: Kaes Ci 89 Ri 84 
: 1 in.d. 
FIGURE 26. Group 23. 
CAPRIFOLIACEAE Group 24, The three members of Caprifoli- 
Sambucus aceae are substantially different from the two mem- 
bers of Valerianaceae so that correct grouping is 
observed (Fig. 27). 
Group 25. 
Viburnum 


This Group is unusual in that Api- 
um and Foeniculum of Apiaceae have identical 


sequences as do Schefflera and Fatsia of Arali- 


aceae. Consequently, the tree of this Group (Fig. 
28) is very simple. 


K 
y, 
Ç y Pastinaca 
N 
g 
le 
Foeniculum 
Apium 


ARALIACEAE 
Schefflera 
Fatsia 


; SCALE ci 96 Ri 96 
: lind. 


SCALE ci 93 Ri 90 
:1 in.d. 
FIGURE 27. Group 24. FIGURE 28. Group 25. 


Volume 78, Number 2 
1991 


THE DERIVATION OF A TREE FOR THE 
GROUPS OF DICOTYLEDONS 


FIRST STAGE; A TEST OF THE 
REALITY OF GROUPS 


Depending on the size and complexity of the 
Group, one, two, or three nodes have been marked 
near the bases of each Group tree; altogether there 
are 58 basal nodes and the ancestral sequence of 
each has been derived using ANALYZE. These 
have been used for a test of the reality or integrity 
of the Groups. If a family does not really belong 
to a Group, it should usually behave like an out- 
group and assume the position closest to the base 
of the tree. Thus, in a simultaneous analysis of all 
58 basal nodes, it would be expected that nodes 
truly belonging to the same Group should cluster 
together. If a family is misplaced in a Group, the 
nodes should separate. 

The only program that can be used with 58 taxa 
simultaneously is HENNIG86 with the option 
mhennig followed by bb. This was done three times, 
each yielding large numbers of trees for which strict 
consensus trees were derived. Inspection indicated 
that most Groups behaved as if they were real, but 
some separation of nodes occurred in Groups 5, 
8, 14, 15, 22, and 24 (see below). It is unlikely 
that this sort of analysis would give a completely 
reliable result, but our interpretation is that where 
there is no separation of within-Group nodes, that 
Group should be accepted as valid. We understand 
that our test is not infallible, but we are reluctant, 
at this stage, to attempt another obvious test, viz. 
the simultaneous analysis of adjoining Groups. This 
test was used earlier with Groups 1, 2, and 3 and 
led to considerable mixing of the first two. The 
amount of convergent evolution between Groups 
is probably such that, if this test were applied 
widely, confusion would result. Therefore, even 
though we understand the limitations, we confine 
our testing of the integrity of Groups to one sort 
of analysis. 

For the six Groups where there was doubt, we 
applied the test devised by Lake (1987). This is 
confined to four species, A, B, C, D and uses a 
chi-square test to decide which is the most probable 
of the three possible relationships, viz. A + B & 
C+DorA+C&B+DorA+D&B+C. 
Thus representatives of each part of a divided 
Group were tested with representatives of the Groups 
with which they most closely clustered. These tests 
gave no further grounds for doubting the integrity 
of Groups 8 and 15 and consequently, in the next 
stage of the analysis, they were included un- 


Martin & Dowd 327 
Angiosperm Phylogeny Using Protein 


Sequences 


changed. The tests reinforced the doubts about 
Groups 14, 22, and 24, so their separate parts 
were added to the list of uncertain families to be 
incorporated later. These were: from Group 14, 
Mimosaceae plus Papilionaceae on the one hand 
and Caesalpiniaceae on the other; from Group 22, 
Convolvulaceae plus Polemoniaceae on the one 
hand and Solanaceae on the other; from Group 
24, Valerianaceae and Caprifoliaceae. Tests with 
Group 5 were equivocal, so Hamamelidaceae was 
removed and added to the list of uncertain families, 
but the node for the remaining three families was 
used at the next stage. 


SECOND STAGE; DERIVING A PRELIMINARY, 
ABBREVIATED TREE 


Following the first stage, the basal node was 
used to represent each of the 22 remaining Groups 
(though amended in Group 5 after removal of Ham- 
amelidaceae). Several analyses, using mhennig and 
bb, were carried out on these nodes. The object 
was to identify apparently constant associations 
from which nodes might be derived in order to 
reduce the number of taxa to 16, a number com- 
patible with analyses using the reliable ie program 
at the next stage. The following five pairings were 
chosen and their nodes derived: Groups 2 and 3; 
Groups 6 and 16; Groups 7 and 9; Groups 11 and 
19; Groups 21 and 23. Group 25 was omitted at 
this stage because it was small, well-defined by 
morphology and our own work and could be in- 
corporated later in the same way as the uncertain 
taxa. The resulting tree of 16 taxa is shown in 
Figure 29. 


THIRD STAGE; INCORPORATING TAXA OF UNCERTAIN 
AFFINITIES 


At this point there were 28 taxa on the list of 
those with uncertain affinities, comprising 18 fam- 
ilies that were not placed in a Group (see Table 1 
but note that Piperaceae and Nelumbonaceae were 
considered earlier), two genera (Humulus and 
Grewia) excluded from Groups during their anal- 
ysis, five families and two pairs of families excluded 
during the first stage, and Group 25 omitted at the 
second stage. We wanted to add these into the 
second stage tree as accurately as possible using 
the ie program. These analyses with 17 taxa could 
each be performed in about a day. Although there 
was some variation, the second stage tree remained 
reasonably stable during these analyses, and we 
noted where each uncertain taxon fit. Six joined 


328 


Annals of the 
Missouri Botanical Garden 


in the basal third, 14 in the middle third, and 16 
in the distal third. (The nonadditivity reflects that 
rigid demarcation was not exercised and borderline 
taxa were placed in two sets.) The members of each 
of the three sets were then analyzed with the cor- 
responding members of the second stage tree and 
possible new or amended Groups were identified. 


FOURTH STAGE; REDEFINITION OF GROUPS 


Putative new or amended Groups were tested 
extensively to ensure that they were real. In this 
process an important factor in determining the 
coherence of Groups was the length of the inter- 
node joining a hitherto uncertain member to the 
Group. Penny et al. (1987) have emphasized that 
“long edges attract," and we have long been aware 
that the junction of a distantly connected taxon is 
subject to so much variation that it is scarcely 
reliable. Thus, we have usually rejected a potential 
new member of a Group if it joins with a dispro- 
portionately long internode and have left it as un- 


certain. 
21 & 23 
4 
SCALE 
kel 
1 ind. 
Ci 91 Ri 91 
8 
2&3 
FIGURE 29. 


had dissolved. 


In three cases, definite hypotheses arising from 
our work could be tested. In each case the question 
was whether a taxon belonged to the Group to 
which it was initially assigned (Table 1) or to the 
Group indicated by stage 3. This could be answered 
by considering the lengths of the alternative trees. 
In two cases, Humulus and Hamamelidaceae, the 
new grouping was shorter and therefore preferred. 
For Grewia, the trees were the same length so 
there was no good reason for preferring the new 
grouping (with Group 18). 

As a consequence of these tests, only 15 of the 
original 25 Groups have the same composition as 
they had before the first stage of this section. The 
other ten Groups have been increased, decreased, 
or merged. Where a nucleus of an original Group 
remains, the number has been retained but A add- 
ed. Original Groups 13, 24, and 25 have disap- 
peared. New Groups 26, 27, 28, and 29 have been 


formed. 


Humulus has been removed. 


Group 4A. 


BASE(1) 


The provisional tree of Group nodes abbreviated by combining some Groups and omitting others that 


Volume 78, Number 2 
1991 


Group 54. Hamamelidaceae has been re- 
moved. 


Group 84. Although the original Group 8 
("Centrospermae") remains intact, Lecithydaceae 


and Humulus join the same branch of the tree (Fig. 
30). 


Couroupita 


Nan anta? 


Lecythis 


Humulus 


Group 8 
UNCHANGED 


¡SCALE Gi a5 Ri 84 
i find. 


Martin & Dowd 
Angiosperm Phylogeny Using Protein 
Sequences 


329 


Group 124. At all stages, Convolvulaceae and 
Polemoniaceae grouped separately from Solana- 
ceae, the other member of Group 22. At the third 
stage, along with Polygonaceae, they clustered with 
Group 12 (Fig. 31). Polemoniaceae and Ericaceae 
are confused, but the other three families grouped 
appropriately. 


Group 14A. The second stage tests suggested 
that the legumes should be divided between Caesal- 
piniaceae on the one hand and Mimosaceae and 
Papilionaceae on the other; Caesalpiniaceae clus- 
tered with Group 13. A series of Lake tests (see 
“First stage” above and Martin & Dowd, 1990) 
was therefore performed. These tests strongly in- 
dicated, first, that Caesalpiniaceae was closer to 
Rosaceae than to either of the other two legume 
families and, second, that Mimosaceae and Papilio- 
naceae were closer to other Groups (e.g., Connar- 
aceae in Group 17, Chrysobalanaceae in Group 
18A below) than were Caesalpiniaceae and Rosa- 
ceae. 

Other second stage tests had indicated that Pro- 
teaceae, Coriaria, Crossosoma, and Hamameli- 


. daceae were also linked to the complex of Groups 
PRONE AU, Gronp: SA, es Pisura 11 dor Grau 4 13 and 14. The tree that resulted when these were 
Cobaea 
ep Arbutus 
LU 
m 
< > 
s ° 
š q Coy 
5 m “Oy, 
Dichondra ¿Ç 
H  Phlox “4 


= 
ë Astroloma 


2 
O 
Si Leucopogon 

uj 


: SCALE 
H LL 
* 4 nd, 


FIGURE 31. Group 12A. 


Rhododendron 


CE He 


Ipomoea 


Fagopyrum 


Ci 81 


Ri 85 


330 


Annals of the 
Missouri Botanical Garden 


n Ci 91 Ri 88 (A) Ge 
HAMAMELIDACEAE 


PERSOONIOIDEAE 
Persoonia 


€ 
<o 


Macadamia 


Adenánthos SCALE Vind. Ci 88 Ri 85 


FIGURE 32. (A) Group 14A. For three legume families 
see Figure 17, and for Proteaceae see (B). 


all analyzed together is shown in Figure 32a. The 
Proteaceae node was derived from Figure 32b, 
while the Mimosaceae- Papilionaceae node is node 


3 of Figure 17a. 


Group 184. Third stage tests suggested that 
the Chrysobalanaceae and Vitaceae might cluster 


with Group 18 and also with Group 25 (Apiaceae 
and Araliaceae). Incorporation of these (Fig. 33) 
does nothing to repair the previous (Fig. 21) dis- 
junction of Haloragaceae while Rhizophoraceae s.l. 
remain apart from Anisophyllea. 


Group 224. With the other two families join- 
ing Group 12A (above), the Solanaceae were left 
as the sole representative. 


Group 26. “This new Group (Fig. 34a) consists 
of three families (Campanulaceae, Caprifoliaceae 
and Goodeniaceae), each with well-paired repre- 
sentatives. With them is Asteraceae, the node for 


which is derived from Figure 34b. 


Group 27. This comprises the families Elaeag- 
naceae and Rhamnaceae, the members of which 


form pairs (Fig. 35). 


Group 28. As noted below, Buxus does not 
pair with Simmondsia, which is sometimes placed 
in Buxaceae. While the latter clusters with Eu- 
phorbiaceae (Fig. 36), Buxus does not. 


Group 29. The species of Hydrophyllaceae, 
Thymelaeaceae, and Valerianaceae form pairs in 


this new Group (Fig. 37). 


TAXA THAT REMAIN UNPLACED 


There are three families for which we have no 
acceptable hypothesis. (a) Loasaceae. It was un- 


RHIZOPHORACEAE s.l. 


e AY 
Anisophyllea `x 


Parthenocissus 


Haloragodendron 


FIGURE 33. Group 18A. 


Bruguiera Carallia 


% 
Chrysobalanus k 2. 


EST 
Parinari < 


Pastinaca 


e 
é 
“ S 
Foeniculum Y” 
Fatsia ARALIACEAE 


Schefflera 


. SCALE 
H ue 


i Ci 89 Ri 85 
i 1 ind. 


331 


Martin & Dowd 
Angiosperm Phylogeny Using Protein 


Volume 78, Number 2 
1991 
Sequences 
Sambucus 
o 
Kä 
Viburnum % Calendula Lactuca 
Q 
S (B) 
(A) e 
Cich Senecio 
at ichorium 
DI : 
hi Helianthus Eupatorium 
O 
Gr 
> 
Gerb 
x ASTERACEAE ASTERACEAE node SES 
$ ! 
š Scaevola Q, ea Wee Bras 
2 ; 
2 
SCALE | Goodenia £€ 
1 ind. E e 
Ci 91 Ri 88 ES 
FIGURE 34. (A) Group 26 with node for Asteraceae from (B). 
Acalypha 
SA 
4 
e 
P d Rhamnus e 
WP 
S 
ES 
Ricinus Glochidion 
Ceanothus 


Simmondsia 


Kc Ci 73 Ri 77 : 


: SCALE 

dae Ci 73 Ri 77 

: lind. 
FIGURE 36. 


Group 28. 


FIGURE 35. Group 27. 


332 Annals of the 


Missouri Botanical Garden 


Nemophila 


VALERIANACEAE 


fortunate that we failed to obtain a sequence for 
Centranthus 


Eucnides bartonioides because this left Mentzelia 
d as a singleton and therefore with a “long edge” 
= 

o 


that joined unreliably. (b) Plumbaginaceae. Al- 


though the two representatives 


Valeriana 


, Limonium and 
Plumbago, paired well, there remained a very long 


internode joining the family to the tree, and so we 
have left it unplaced. (c) Buxaceae. Originally both 
Buxus and Simmondsia were chosen as represen- 
tatives of Buxaceae (s.l.), but they proved quite 
different and, since there was taxonomic opinion 
FIGURE 37. Group 29. 


to support this, they were treated as such. Whereas 


ou H 
SCALE 1 ind. Ci B2 RI 8B. | 


LAM VRB 
21 


26 AST CAM CPR GOD 
RR MIM SAX 
ER PEE RAN CUN PPL HAM PRT 
SPUR 14A HYD THY VAL 
BOMDPC ELC, 18A ANS CHB RHZ VIT 
MLV STR TIL 9 API ARL HAL 28EUP SMM 
JUG MYR 16CEL OLC 
SCR PED 23 
C 
GSN BIG MRS PRM SPT STY L OBR rs? Ppp IG? LU DIL OCH TEA 
otic E MEL SAP CNN RUT SMR 
GER TRP MLP ZY 
MOR BET O GEN 
pm ULM LOG OLE RUB 
= MRT MLS LYT15 12A CNV PLG PLM 
TRA CMB ONA PUN 
8ALCY CRY AMA 
22A — NYC CHN PHT 
SOL 
nds RAN PAP LAR 2 
BER MNS 
SCALE Ci 91 Ri 87 
1 n.d. 


ANN MAG LAU ARS MNM 
1SCS CAL MYS WIN 


FIGURE 38. The overall tree for the dicotyledons. Groups are numbered and their constituent families indicated 


underlined 


using the three-letter acronyms of Weber (1982), given in Table 2. Families in which nitrogen-fixation is known are 


Volume 78, Number 2 
1991 


DISTANCE FROM BASE IN LN.D. 


0 
GROUPS 
FIGURE 39. 


Martin & Dowd 
Angiosperm Phylogeny Using Protein 
Sequences 


MEAN OF GROUP MEANS 


1 2 3 22A8A 15 5A 11 17 1912A 20 23 4 10 6 16 28 14A 9 27 7 18A 29 26 21 


Groups are arranged along the X axis in the order that they depart from the trunk of the overall 


tree (Fig. 38). Solid dots are the mean distances of species of that Group from the angiosperm origin, and bars 


indicate the range from smallest to greatest. 


Simmondsia grouped reasonably well with Eu- 
phorbiaceae, Buxus did not and remains unplaced. 


FIFTH STAGE; THE SIMULTANEOUS 
ANALYSIS OF REVISED GROUPS 


Initially, the nodes of all 26 revised Groups were 
analyzed using the option mhennig followed by bb, 
and the resulting tree was divided into a top, middle, 
and bottom section. Thus, with overlaps, each con- 
tained 14 taxa, a number that could be analyzed 
using the ie option. Fortunately, there was no con- 
fusion at the overlaps, and the three parts were 
fitted together to give the overall tree (Fig. 38). 


DISCUSSION 


THE RATE OF EVOLUTION AND THE 
AGE OF THE ANGIOSPERMS 


In Figure 38, which shows the overall tree for 
the dicotyledons, there is a “trunk” from which 
branches depart at irregular intervals of up to 5 
i.n.d. In Figure 39, we arranged Groups in the 
order that they branch from the trunk. For every 
species we measured the number of differences (in 
i.n.d.) between it and the base of the angiosperm 
tree (Fig. 1). For each Group we show the mean 
of these distances and also the range from smallest 
to greatest. The mean of all Groups is 16.2 i.n.d. 
We have also analyzed variance and shown that 
there is significant (P « 0.001) variation between 
Groups. Thus, although the difference between a 
slowly evolving Group such as Group 3 (mean 14.1) 
and a rapidly evolving Group such as Group 21 
(19.7) is not great, it is probably real. 

The age of the dicotyledons can be derived from 
the product of the mean number of differences of 
species from base and the rate of evolution. Since 


Figure 6 suggests that the monocotyledons are 
derived from the dicotyledons this is also the age 
of the angiosperm. Martin and Dowd (1988) es- 
timated the rate to be 1 i.n.d. in 14 Ma for a single 
evolutionary line. However, this estimate was based 
on members of the Fagaceae, Proteaceae, Sola- 
naceae, and Winteraceae, all of which belong to 
Groups that evolve more slowly than average; their 
mean number of differences from base is 14.7 i.n.d. 
Thus, the inferred age of the angiosperms is 14 x 
14.7 — 205 Ma, that is, at the beginning of the 
Jurassic. Crane et al. (1989) and Wolfe et al. 
(1989) have estimated the age of the angiosperms 
as 200 Ma. If the monocotyledons are indeed de- 
rived from the dicotyledons, there is good agree- 
ment. 


THE RELIABILITY OF OUR TREES 


The current limitations of computers and com- 
puting programs make it impossible to conduct a 
large phylogenetic analysis in a completely objec- 
tive manner. Our first important deviation from 
objectivity has been accepting taxonomic opinion 
that species belong to the same family. Our second 
has been seeking a consensus in placing these into 
Groups. 

The assumption of correct assignment to families 
is strongly supported by the correct pairing (or 
formation of clusters of three when appropriate) 
shown in the objectively derived figures of the final 
26 Groups. Of the 95 families with two or three 
representatives, only 11 had disjunct representa- 
tives and, of these, at least four were families sensu 
lato with taxonomic opinions that they should really 
be split. These are the separation of Humulus from 
Morus in Group 4, of Flindersia from other Ru- 
taceae (Group 17), of Buxus from Simmondsia 


Annals of the 
Missouri Botanical Garden 


(Group 28), and of Anisophyllea from other Rhi- 
zophoraceae (Group 18). When it is further con- 
sidered that one aberrant species can disrupt two 
families, we submit that the high proportion of 
correct grouping is strong evidence not only for 
the correctness of other taxonomy at this level but 
also of the soundness of our approach. 

If our methods, while not perfect, are good at 
the level of placing taxa into families, is there any 
reason why they should not be equally acceptable 
at higher levels? We have investigated this with 
the assumption that the probability of errors will 
increase as internode lengths decrease. From each 
of the Group trees we have determined that the 
average length of internodes within families (re- 
stricting the measurements to families with only 
two correctly paired representatives) is 5.6 i.n.d., 
while the average length of internodes between 
families is 4.7. From the final tree showing the 
relationships of Groups (Fig. 38), the average length 
of internodes is 3.0. Thus, if our assumption is 
valid, the ratio 5.6: 4.7: 3.0 should reflect the re- 
liability of arranging species within families, fam- 
ilies within Groups, and Groups in the final tree. 
We suggest caution about accepting relationships 
as the taxonomic level increases. 

There is no obvious reason why the ratio just 
reported should not be similar for other macro- 
molecular sequences. However, with nucleic acid 
sequencing (see review by Palmer, 1988) the 
amount of information available might increase by 
an order of magnitude over that presented here; 
thus, even if internode lengths at the highest levels 
are still proportionately small, the probability of 
errors due to chance when using small numbers 
should diminish and lead to more decisive phylog- 
enies. 


THE VALUE OF THIS STUDY 


We believe that the demarcation of plant taxa 
at all levels should be the prerogative of botanists 
with a broad background in taxonomy and that the 
same specialists are best suited to compare the 
results of this study, expressed as phylogenetic 
trees, with published phylogenies. Because we do 
not have that background, we resist the temptation 
to point out the similarities and differences that we 
perceive and to assess when our trees are likely to 
be incorrect. Our perceptions are likely to be un- 
balanced. 


One difference between this phylogenetic study 
and most others is that it is repeatable. Without 
detracting from the value of published angiosperm 
phylogenies, they do seem to depend on the ac- 
cumulated wisdom and experience of rare individ- 
uals whose relevant brain functions are not easily 
transmitted in entirety. On the other hand, anyone 
who follows our procedures should arrive at the 
same phylogenetic trees. More to the point, with 
improved analytical procedures it is possible that 
more acceptable endpoints may be reached. 

We have avoided the word “conclusions” be- 
cause we do not claim that this work is definitive. 
Rather it has led to new working hypotheses which, 
we hope, others will test with more extensive sam- 
pling and more data including much longer se- 
quences. To such investigators our analytical meth- 
od, whether perceived as successful or not, may 
be a useful example. 


NATURAL SELECTION AND THE 
EVOLUTION OF RUBISCO-SSU 


Under “General remarks about the sequences,” 
we discussed heterogeneity within species and quot- 
ed the evidence of Pichersky et al. (1986) that 
natural selection acts to keep the amino acid se- 
quence constant. Below we present other evidence 
for the importance of natural selection. 

Under “Methods of Data Analysis,” we dis- 
cussed Lake’s test, which is based only on trans- 
versions (mutations from a purine to a pyrimidine 
or vice versa) and ignores transitions (purine to 
purine or pyrimidine to pyrimidine). Lake (1987) 
quoted evidence (Brown et al., 1982) that in animal 
mitochondrial DNAs, transitions occur an order of 
magnitude more frequently than transversions. 
Zimmer et al. (1989) have found for higher plant 
cytoplasmic rRNA that, on average, transitions 
were twice as frequent as transversions with the 
lowest ratio in the most invariant regions. We have 
investigated this in 44 families of Groups 1 to 10 
and have scored those amino acid changes within 
families that can be ascribed unequivocally to trans- 
versions and transitions. There were 123 transi- 
tions and 306 transversions, a proportion of 0.287 
transitions and therefore quite different from the 
evidence just quoted. 

We have considered each of the 61 codons in 
the genetic code and, assuming that each nucle- 
otide can change to another with the same prob- 


Volume 78, Number 2 
1991 


Martin & Dowd 
Angiosperm Phylogeny Using Protein 
Sequences 


335 


ability, calculated the frequencies of the four pos- 
sibilities, i.e., transition causing amino acid change, 
transversion causing amino acid change, no amino 
acid change, and lethality (stop). Thus, for the two 
codons that determine phenylalanine the ratio of 
transitions to transversions is 0.25, for eight of the 
amino acids it is 0.33, and the ratio varies from 
0.14 to 0.34 with an average of 0.2845. Averaging 
the 31 variable amino acids in the top line of Table 
3 gives the ratio 0.268, which may be compared 
with the observed figure of 0.287. This suggests 
that, at the nonsilent positions, which are the only 
ones we are able to consider, there is close to 
randomness with respect to the occurrence of tran- 
sitions and transversions. 

We suggest that the large discrepancy between 
our result and the expectations from chemistry and 
nucleic acid sequencing is due partly to our inability 
to score silent substitutions and partly to the over- 
whelming importance of natural selection in de- 
termining the amino acid sequence of an important 
enzyme. Even though most nucleotide substitutions 
are presumably transitions, this has little effect on 
the final outcome, the amino acid sequence, on 
which natural selection can act. 

Other evidence of strong natural selection comes 
from consideration of variation at positions like 8 
and 9. At position 8, 84% of species have glycine 
and 15% asparagine. This substitution requires at 
least two nucleotide changes so, in the absence of 
selection, the single-change intermediates serine or 
lysine would often be expected, though they have 
not been observed. Similarly, at position 9, 56% 
of species have leucine and 28% lysine. Again, this 
is a two-nucleotide change, but the only single- 
change intermediates found are methionine and 
isoleucine, and these are much too rare to occur 
randomly. Apparently, glycine and asparagine are 
“adaptive peaks” at position 8 and leucine and 
lysine are at position 9. When positions 8 and 9 
are considered together, there is a small excess 
over chance expectations of the combinations gly- 
cine-leucine and asparagine-lysine; these may be 
adaptive peaks because both combinations are found 
within Tiliaceae (Group 9), Papilionaceae (Group 
14A), Apocynaceae (Group 20), Proteaceae (Group 
14A), and different families of Group 15. Clearly, 
convergent evolution has occurred. 

This last evidence suggests that adjacent posi- 
tions influence one another, which is known. An- 
other example is probably found in the Onagraceae, 


the only family with N-terminal phenylalanine and, 
alongside it, asparagine, again only found in On- 
agraceae. Solanum species with the same rare 
substitutions at positions 15 and 21 are examples 
that the effect can extend further. Another example 
concerns positions 30 and 39, both of which are 
almost always either valine (V) or isoleucine (I). 
The frequencies within species of the four possible 
combinations (VV, VI, IV, II) indicate that the two 
positions evolve independently; nevertheless, they 
are different in the monocotyledons, with 67.596 
isoleucine, and the dicotyledons, with 35.446 iso- 
leucine. It is conceivable that monocotyledons are 
richer in isoleucine because they have a more ef- 
ficient synthetic pathway for isoleucine so that, in 
the absence of other strong selective forces, the 
substitution of isoleucine for valine may be favored. 

Despite the evidence that natural selection is 
acting strongly, there are few decisive changes, 
such as the change from proline to isoleucine at 
position 6 during the evolution of the monocoty- 
ledons. At positions 7 and 8, the combination ty- 
rosine-asparagine occurs in the gymnosperms, 
Groups 1, 2, and 3, but in no other Groups, sug- 
gesting that these amino acids are primitive. How- 
ever, the distinction between primitive and ad- 
vanced is usually equivocal; for the following 
example, normal taxonomic criteria have been used 
to distinguish 58 primitive genera (those in the 
gymnosperms, Piperaceae, Nelumbonaceae, and 
Groups 1, 2, 3, and 5) from 67 advanced genera 
(those in Asteraceae, Campanulaceae, Goodeni- 
aceae, Hydrophyllaceae, and Groups 10, 20, 21, 
22, 23, and 24). At position 12, tyrosine occurred 
in 5% of primitive and 31% of advanced genera 
while at position 20 aspartic acid occurred in 5% 
of primitive and 43% of advanced genera. While 
admitting that the sampling is not entirely satis- 
factory, it appears that tyrosine at position 12 and 
aspartic acid at position 20 are advanced. How- 
ever, the important point is that the divergence is 
so indecisive, the primitive amino acids phenylal- 
anine at position 12 and proline at position 20 still 
occurring in the majority of genera in all advanced 
Groups. 

If it is correct that natural selection acts strongly 
to determine the amino acid sequence of a protein, 
this could be important in considering “molecular 
evolutionary clocks.” If the clock that is considered 
is derived from nucleic acid sequences, the rare 
event that is the basis of regression of number of 


336 


Annals of the 
Missouri Botanical Garden 


differences on time is nucleotide substitution, the 
most common form of mutation and not always 
subject to natural selection. If, however, the clock 
is derived from amino acid sequences, the rare 


LITERATURE CITED 


BERRY-LowE, S., T. D. McKnicut, D. M. SHAH & R. B. 
MEAGHER. 1982. The nucleotide sequence, ex- 
pression, and evolution of one member of a multigene 
family encoding the small subunit of ribulose-1,5- 
bisphosphate carboxylase in soybean. J. Molec. Appl. 
Genet. 1: 483-498. 

Brown, W. M., E. M. Pracer, A. WANG & A. C. WILSON. 
1982. Mitochondrial sequences of primates: tempo 
and mode of evolution. J. Molec. Evol. 18: 225- 
239. 

CARPENTER, J. M. 1988. Choosing among multiple 
equally parsimonious cladograms. Cladistics 4: 291- 
296. 

CHAPMAN, M. S., S. W. Sun, P. M. G. Curmi, D. Cascio, 
W. W. SmITH & D. S. EISENBERG. 1988. Tertiary 
structure of plant RuBisCO: domains and their con- 
tacts. Science 241: 71-74. 

CRANE, P. R., M. J. DONOGHUE, J. A. Dor & E. M. 
FRIIS. 1989. Angiosperm origins. Nature (London) 
342: 131. 

CRoNQUIST, A. 1976. The taxonomic significance of the 
structure of plant proteins; a classical taxonomist’s 
view. Brittonia 28: 1-27. 

1981. An Integrated System of Classification 
of Flowering Plants. Columbia Univ. Press, New York. 

DAHLGREN, R. 1983. General aspects of angiosperm 
evolution and macrosystematics. Nord. J. Bot. 3: 
119-149. 

Farris, J. S. 1988. HENNIG86 Reference: Version 
1.5. James S. Farris, Port Jefferson Station, New 
York. 

GRUND, C., J. GiLROY, T. GLEAVES, U. JENSEN & D. 
BOULTER. 1981. Systematic relationships of the 
Ranunculaceae based on amino acid sequence data. 
Phytochemistry 20: 1559-1565. 

HasLETT, B. G., A. YARWoop, I. M. Evans & D. BOULTER. 
1976. Studies on the small subunit of fraction 1 
protein from Pisum sativum L. and Vicia faba L. 
Biochim. Biophys. Acta 420: 122-132. 

Henpy, M. D. & D. PENNY. 1982. Branch and bound 
algorithms to determine minimal evolutionary trees. 
Math. Biosci. 59: 277-290. 

HEvwoop, V. H. 1978. Flowering Plants of the World. 
Oxford Univ. Press, London. 

Jounson, D. 1987. More approaches to the travelling 
salesman guide. Nature (London) 330: 525. 

KNicHr, S., I. ANDERSON & C. I. BRANDEN. 1989. Re- 
examination of the three-dimensional structure of the 
small subunit of RuBisCO from higher plants. Science 
244: 702-705. 

LAKE, J. A. 1987. A rate-independent technique for 
analysis of nucleic acid sequences: evolutionary par- 
simony. Molec. Biol. Evol. 4: 167-191. 

LAMB, C. & L. FrrzMAURICE. 1986. Tailoring crop 
improvement. Nature (London) 324: 414. 

Martin, G. C., J. Keen, B. V. Fonp-Liovp & H. J. 
NEWBURY. 1987. Isolation and partial amino acid 
sequencing of the small subunit of ribulose-1,5-bis- 


event is not mutation alone but its incorporation 
into the gene pool by natural selection. In both 
cases clocklike behavior is observed, but the rare 
event is different. 


phosphate carboxylase/oxygenase from three species 
of orchid. Pl. Sci. 50: 233-237. 

MARTIN, P. G. 1979. Amino acid sequence of the small 
subunit of ribulose-1,5-bisphosphate carboxylase from 
spinach. Austral. J. Pl. Physiol. 6: 401-408. 

& J. M. Dowp. 1984a. The study of plant 

phylogeny using amino acid sequences of ribulose- 

1,5-bisphosphate carboxylase. III Addition of Mal. 
vaceae and Ranunculaceae to the phylogenetic tree. 

Austral. J. Bot. 32: 283-290. 

& 1984b. The study of plant phy- 

logeny using amino acid sequences of ribulose-1,5- 

bisphosphate carboxylase. IV Proteaceae and Fa- 
gaceae and the rate of evolution of the small subunit. 

Austral. J. Bot. 32: 291-299. 

& 1984c. The study of plant phy- 

logeny using amino acid sequences of ribulose-1,5- 

bisphosphate carboxylase. V Magnoliaceae, Polygo- 

naceae and the concept of primitiveness. Austral. J. 

Bot. 32: 301-309. 

& 1986a. Phylogenetic studies using 

protein sequences within the order Myrtales. Ann. 

Missouri Bot. Gard. 73: 442-448. 

& 1986b. A phylogenetic tree for 

some monocotyledons and gymnosperms derived from 

protein sequences. Taxon 35: 460-475. 

1986c. Is the gene for haemo- 

globin widespread among dicotyledons? Pp. 81-82 

in W. Wallace & S. E. Smith (editors), Proceedings 

of the Eighth Australian Nitrogen Fixation Confer- 
ence. Australian Institute of Agricultural Science, 

Parkville, Victoria. 

& 1988. A molecular evolutionary 

clock for angiosperms. Taxon 37: 364-377. 

1989. Phylogeny among the flow- 

ering plants as derived from amino acid sequence 

data. Pp. 195-204 in B. Fernholm, K. Bremer & 

H. Jornvall (editors), The Hierarchy of Life. Elsevier, 

Amsterdam. 

& 1990. A protein sequence study 

of the dicotyledons and its relevance to the evolution 

of the legumes and nitrogen fixation. Austral. Syst. 

Bot. 3: 91-100. 

& A. C. JENNINGS. 1983. The study of plant 

phylogeny using amino acid sequences of ribulose- 

1,5-biphosphate carboxylase. I Biochemical methods 

and the patterns of variability. Austral. J. Bot. 31: 

395-409. 

, D. BOULTER & D. PENNY. 1985. Angiosperm 

phylogeny studied using sequences of five macro- 

molecules. Taxon 34: 393-400. 

, J. M. Dowp & S. J. L. Srone. 1983. The 

study of plant phylogeny using amino acid sequences 

of ribulose-1,5-bisphosphate carboxylase. II The 
analysis of small subunit data to form phylogenetic 

trees. Austral. J. Bot. 31: 411-419. 

, , C. Morris & D. E. Symon. 1986. 

The study of plant phylogeny using amino acid se- 


Volume 78, Number 2 
1991 


quences of ribulose-1,5-bisphosphate carboxylase. VI 
Solanum species from different continents. Austral. 
J. Bot. 34: 187-195. 

Nakano, T., T. Hase & H. MATSUBARA. 1981. The 
complete amino acid sequence of parsley (Petroseri- 
num sativum) ferredoxin. J. Biochem. 90: 1725- 
1730. 

PALMER, J. D., R. K. JANSEN, H. J. MICHAELS, M. W. 
CHASE & J. R. MAHHART. 1988. Chloroplast DNA 
variation and plant phylogeny. Ann. Missouri Bot. 
Gard. 75: 1180-1206. 

PENNY, D., M. D. HENpY & I. M. HENDERSON. 1987. 
Relability of evolutionary trees. Cold Springs Harbour 
Symposium on Quantitative Biology 52: 857-862. 

PICHERSKY, E., R. BERNATSKY, S. D. TANKSLEY & A. R. 
CASHMORE. 1986. Evidence for selection as a 
mechanism in the concerted evolution of Lycoper- 
sicon esculentum (tomato) genes encoding the small 
subunit of ribulose-1,5-bisphosphate carboxylase /ox- 
ygenase. Proc. Natl. Acad. U.S.A. 83: 3880-3884. 

RAMSHAW, J. A. M. 1982. Structures of plant proteins. 
Pp. 229-290 in D. Boulter & B. Parthier (editors), 
Nucleic Acids and Proteins in Plants 1. Encyclopae- 
dia of Plant Physiology, NS Volume 14A. Springer, 
Berlin. 

Roy, H., A. VALERI, D. H. Pope, L. RUCKERT & K. A. 
Costa. 1978. Small subunit contacts in ribulose- 
1,5-bisphosphate carboxylase. Biochemistry 17: 665- 
668. 


Martin & Dowd 337 


Angiosperm Phylogeny Using Protein 
Sequences 


Scocin, R. 1981. Amino acid sequence studies and 
plant phylogeny. Pp. 229-290 in D. A. Young & 
D. S. Seidler (editors), Phytochemistry and Angio- 
sperm Phylogeny. Praeger, New York. 

STROBAEK, S., G. C. GIBBONS, B. HASLETT, D. BOULTER 
& S. G. WILDMAN. 1976. On the nature of the 
polymorphism of the small subunit of ribulose-1,5- 
diphosphate carboxylase in the amphidiploid Nico- 
tiana tabacum. Carlsberg Res. Commun. No. 41: 
335-343. 

TAKHTAJAN, A. 1983. The systematic arrangement of 
dicotyledonous families. Pp. 180-201 in C. R. Met- 
calfe & L. Chalk (editors), Anatomy of the Dicoty- 
ledons, Vol. II (2nd Edition). Clarendon Press, Ox- 
ford. 

THORNE, R. F. 1983. Proposed new realignments in 
the angiosperms. Nordic J. Bot. 3: 85-117. 

WEBER, W. A. 1982. Mnemonic three-letter acronyms 
for the families of vascular plants: a device for more 
effective herbarium curation. Taxon 31: 74-88. 

Wo re, K. H., M. Gouy, Y-W. Yanc, P. M. SHARP & 
WH Li. 1989. Date of the monocot-dicot diver- 
gence estimated from chloroplast DNA sequence data. 
Proc. Nat. Acad. U.S.A. 86: 6201-6205. 

ZIMMER, E. A., R. K. HAMBY, M. L. ARNOLD, D. A. 
LEBLANc & E. L. THERIOT. 1989. Ribosomal RNA 
phylogenies and flowering plant evolution. Pp. 205- 
214 in B. Fernholm, K. Bremer & H. Jornvall (ed- 
itors), The Hierarchy of Life. Elsevier, Amsterdam. 


